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LECTURE NOTES 



MICHAEL CWIKEL, YORAM SAGHER AND PAVEL SHVARTSMAN 

Abstract. We develop some techniques for studying various versions of the function space 
BMO. Special cases of one of our results give alternative proofs of the celebrated John- 
Nirenberg inequality and of related inequalities due to John and to Wik. Our approach 
enables us to pose a simply formulated "geometric" question, for which an affirmative 
answer would lead to a version of the John-Nirenberg inequality with dimension free con- 
stants. 

A more detailed summary of the main ideas and results of this paper can be found at 
http : //www. math. technion. ac . il/~mcwikel/bmo/CwikSaghShvaSummary . pdf 



1. Introduction. Our main question. 

We begin by inviting the reader to consider and hopefully even answer the following 
question. We will subsequently refer to it as "Question A". 

Do there exist two absolute constants r G (0, 1/2) and s > which have the following 
property ? 

For every positive integer d and for every cube Q in W 1 , whenever E + and E_ 
are two disjoint measurable subsets of Q whose d-dimensional Lebesgue mea- 
sures satisfy 

min {X(E+), A(£_)} > r\{Q \E+\EJ) , 
then there exists some cube W contained in Q for which 

min {X(W n E+), X(W n^)}> s\(W) 



We are led to consider this question because of our interest in the space BMO of functions 
of bounded mean oscillation introduced by John and Nirenberg |12| . We recall that these 
are the functions / : R d — > R which have the property that 



SU P\77^ / \f ~ fQ\ dX < 00 
Q X(Q) Jq 



where the supremum is taken over all cubes Q in R d and where /q is the average of / on 

Q. 



The research of the first and third named authors was supported by funding from the Martin and Sima 
Jelin Chair in Mathematics, by the Technion V.P.R. Fund and by the Fund for Promotion of Research at 
the Technion. 
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We will show that an affirmative answer to Question A would have very interesting con- 
sequences for the study of a remarkable property of functions of bounded mean oscillation. 
It would imply (see Theorem 19.11) that the following "dimension free" version 

(1.1) A ({x € Q : |/W - m,\ > a}) < max U, 2^1 . X(Q) . e X p (-^gj) 



of the John-Nirenberg inequality \1'2\ holds for every a > 0. It would also imply some 
slightly stronger inequalities. (Here mj is any median of the measurable function / on the 
cube Q in R d .) 

Having formulated our question, let us now state to what extent we have been able, so 
far, to answer it or to simplify it. 

For each particular value of d G N we can find numbers, r G (0, 1/2) and s > which do 
have the property sought in Question A. Furthermore we can show that their having this 
property, implies that the inequality f 1 1.11) is satisfied. 

We do not yet have an answer to Question A, because at least one of our constants r 
and s depends on d. We can take, for example, r = \/2 — 1, but, for that choice of r, we 
have only been able to obtain a value of s which depends on d, namely s = 2~ d (3 — 2\/2j • 

Regardless of whether r and s really have to depend on the dimension, it seems of interest 

(cct \ 
— —r ) on the right hand side of our 
11/ WbmoJ 

version (II. ip of the John-Nirenberg inequality, we have revealed a quite explicit connection 
between the constants C and c and a geometric property expressed by the constants r and 
s. 

It also seems of interest that the "geometric" condition sought in Question A is, more or 
less "equivalent" to an analytic condition which compares certain kinds of BMO "norms" of 
functions / on M. d with related kinds of BMO "norms" of their rearrangements /* on (0, oo). 
(The implications, in two opposite directions, which express this "sort of equivalence" are 
precisely formulated and established in Theorems 18.31 and 18.51 ) 

As we shall show in Section [TOj if Question A can be answered affirmatively in some 
special cases, then this will suffice to answer it in general. In particular it would suffice to 
give an affirmative answer in the case where the subsets E + and E_ are each finite unions 
of dyadic subcubes of Q. Thus Question A can be considered to be a combinatoric question 
as much as a geometric one. 

Our results can be expressed in more abstract terms, and they apply to other versions 
of the space BMO including the one considered by Wik [27], where cubes are replaced by 
"false cubes". 

This preliminary version of our paper is written more or less in the style of "lecture notes". 
We hope that this will make it helpful for graduate students and that experts will forgive 
us for writing more, maybe much more than they need to read about various things. We 
have attempted to find some sort of reasonable middle way between what may suit these 
two subsets of our audience by relegating quite a number of better known facts, results and 
proofs to the appendices in Section [TTJ 
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We have surely omitted references to some very pertinent papers about this topic, but 
we hope to correct at least some of our omissions in future versions. The reader is invited 
to draw our attention to any such omissions. 



2. Notation, terminology and some more introduction 

Throughout this paper d will denote a positive integer and A will denote rf-dimensional 
Lebesgue measure on M d . The value of d will always be clear from the context. When d = 1 
we will also often use the notation \E\ instead of \(E) for each measurable subset E of IBL 
By a cube in M. d we will always mean a d-dimensional closed cube with sides parallel to the 
axes. 

Definition 2.1. To save tedious repetitions of terminology, we will say that a set E is 
admissible if it is a measurable subset (i.e., a Lebesgue measurable subset) of M. d and its 
(i-dimensional Lebesgue measure satisfies < X(E) < oo. 

For each admissible set E and each measurable real valued function / whose domain of 
definition contains E , we define the mean oscillation of / on E by 

(2.1) 0{ f,E): =iMij L.jjf-c\dX. 

It is convenient to fix some notation for two other frequently used variants of the func- 
tional 0(f,E). So we set 

for every function / which is integrable on E, and where /e := -wW j E fdX. We also set 



X(E) Je ■ 

T7^ // \f(x)-f(y)\dX(x)dX(y) 

>ExE 



X(Ef 



("A" and "D" are our abbreviations for "average" and "double integral" respectively). We 
recall that the set of medians of / on E consists of all numbers cGK which satisfy 

A {{x e E : f(x) < c}) < -X(E) and A ({x G E : f(x) > c}) < -X(E) . 
This set is always non empty, and the infimum in (12.11) is attained, i.e., 
(2-2) 0(f,E) = ^-Jjf-c\dX 

whenever c is a median of / . We also recall that 

(2.3) 0(/, E) < A(f, E) < D(/, E) < 2 0(/, E) 

for all functions / which are integrable on E. For the reader's convenience we recall the 
easy proofs of these standard facts in Appendix 111.11 
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Definition 2.2. Let D be some measurable subset of M d with positive measure and let £ be 
some collection of admissible subsets E of D. We define the space BMO(D,£) to consist 
of all (equivalence classes of) measurable functions / : D — > M for which the seminorm 

(2-4) \\f\\BMO(D, £) --= 0(f,E) 

is finite. 

One may also define BMO(D,£) equivalently via either one of the seminorms 
(2-5) \\f\\^ IO(D£] :=sn V A(f,E) 



\BMO{D,E) 



or 



(2-6) \\f\C w{D£) = snpB(f,E) 

y ' EeS 

which (cf. (12. 3p ) are each equivalent to the seminorm (12.41) to within constants of equivalence 
1 and 2. 

Of course if / coincides a.e. with a constant function then ||/||sa/o(d s) = 0- The reverse 
implication may also be true for suitable choices of D and £ . In all cases the seminorm 
IN I bm OfD £) defines a norm on suitable equivalence classes of functions in BMO(D,S) 
which may, for suitable choices of D and £, be simply equivalence classes of functions 
modulo constants. 

The most frequently considered way of choosing D and £ is: 



(2.7) 



D is either Mr or some fixed cube in M. d and £ is chosen 
to be Q(D), the collection of all cubes contained in D. 



As the reader no doubt recalls, functions of the space BMO(D, £) were first introduced and 
studied by John and Nirenberg [12] for the case where D is a cube in M. d and £ = Q(D). 
The original motivation for studying these functions apparently came from John's study 
|1U| of problems in the theory of elasticity, related in particular to the concept of elastic 
strain. One of the first applications of [12J was in a paper [17J by Moser extending Harnack's 
theorem about harmonic functions to functions which are solutions of elliptic second order 
PDEs. But the space of these functions and its analogues have since turned out to also have 
many other deep properties and numerous other, sometimes quite surprising applications 
in analysis. One particularly notable example of such an application is the connection with 
H p spaces revealed in the paper [6] of Fefferman and Stein. 

The choice of D and 8 specified in (12. 7p is only one among several possible interesting 
choices, and we will list four more examples of such choices now, taking the opportunity to 
also fix our notation for them, notation which will be used thoughout the paper. In each 
of these examples we will take the set D to either be M d or some measurable subset of M. d 
with non empty interior. 



:2.8i 



£ is chosen to be T>(D), the collection of all dyadic cubes 
contained in D . 
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(2.9) 

(2.10) 

(2.11) 



£ is chosen to be 13(D), the collection of all euclidean balls 
contained in D . 

£ is chosen to be K(D), the collection of all bounded closed 
convex subsets of D which have non empty interiors. 

£ is chosen to be W(D), the collection of all special rectangles 
contained in D . 



By special rectangles we mean all those subsets of M. d which are the cartesian products 
Ii x I 2 x . . . x I d of d bounded closed intervals of positive length, where, for each j = 1 , 2, . . . . , d, 
the length \Ij\ of Ij equals either min^i^,...^ \Ik\ or 2 minj—12 .4 \Ik\ ■ Such sets, and their 
associated space BMO(M. d , W(R d )) were introduced and studied by Wik in [57]. He used 
the terminology "false cubes" for special rectangles. Below we will describe his results in 
more detail. 

Of course the seminorm || J|| BM om k(d)) I s l ar g er than any of the other seminorms 
II/IIbmo(d £)) arising from the other choices of £ listed just above, and for this reason 
it will be of less interest for us here for the particular aims of this paper. However we 
remark that a result of Nazarov, Sodin and Vol'berg ([IS] p. 13 and |19] ) shows that every 
polynomial P : M. d — > R of degree n satisfies 

4 + log 4 

(2-12) 1 1 1°S \P\\\BMO(D,K{R d )) - 2 U ' 

It is remarkable that there is no dependence on the dimension d in this inequality. We are 
naturally led to ask whether the left side of (I2.12p can also be bounded from below by cn 
for some absolute positive constant c. If this can be shown to be the case, then other results 
in [T5] would imply that a dimension free version of John-Nirenberg inequality holds, for 
D = M. d and £ = /C(IR d ), at least for all functions of the special form log \P\. An analogous 
question with analogous consequences can be asked for the apparently more difficult and 
perhaps more interesting case where £ is chosen to be Q(~R d ). 

There are also other more "exotic" versions of the space BMO. But these seem to be 
quite beyond the scope of what we will study in this paper. For example, the measure 
A may be replaced by a more general measure, and, furthermore, the underlying set M. d 
may be replaced by other suitable sets. There is even a version of BMO in the setting of 
martingales. 

We have already alluded above to the following result in [12] . in fact the main result of 
that paper. Let Dbea cube in M. d and let / : D — > R be a function belonging to the space 
BMO(D, Q(D)). Then 



A ({x G D : \ f(x) -f D \>a})< BX(D) exp 



ba 



(2.13) ,JW -j,^-,.^-*,. (A) 

V ' \ \\J \\BMO(D,Q(D)) . 

for every a > , 

where B and b are constants which depend only on the dimension d. (In fact in the 
formulation of the main result (Lemma 1) of [12] the number fjj appearing on the right 
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side of (12. 13 ft is not explicitly chosen to be the average of / on D. Nor is the functional 
II / 1 1 bmo(d Q(D)) explicitly chosen for estimating the mean oscillation of /. However such 
choices are made in Lemma 1' of the same paper which is used to obtain Lemma 1.) 

Remark 2.3. The result that (I2.13P holds is an optimally strong result, in the following 
sense: Suppose that / : D — > R is an arbitrary integrable function which satisfies an 
estimate like (I2.13P for every subcube Q of D, but with some fixed constant independent of 
Q in place of ||/Hba/o(d Q(z>)V Then a simple calculation shows that / G BMO(D, Q(D)). 
Conversely, the result of |12j obviously gives us that a function / G BMO(D, Q{D)) satisifes 
an estimate like (I2.13P for every subcube Q of D. 

The inequality (12.131) . together with various generalizations and variants of it, will be at 
once our main motivation and our main interest in this paper. In fact (12. 13[) is the key 
to obtaining various other properties of BMO and has been widely studied further since 
its original discovery. The proof of (I2.13P in [12] uses a famous lemma of Calderon and 
Zygmund. Among other proofs of f)2.13p . one of the simpler ones is due to Bennett, DeVore 
and Sharpley [3] (see the remark at the end of Section 3 on p. 607 of |3]) using a covering 
lemma which appeared previously in j3]. 

Remark 2.4. It is a simple exercise to show that the constant B in (I2.13P must necessarily 
satisfy B > 1. Furthermore, versions of (12.131) have been proved in which B = 2. (See e.g., 
|24] or the results of |27] which we shall also discuss below.) 

Remark 2.5. It seems appropriate to make one specific remark regarding the differing for- 
mats of results in quite a number of papers which have followed on from [12]. These are 
papers which present other proofs of (I2.13P or similar inequalities, some of them general- 
izing to other settings and to other variants of BMO. By "similar inequality" we mean a 
version of (I2.13P which is modified in one or more of the following ways: For example the 

seminorm ||/|| BMO (d,s(d)) or W f \\f mo(d,q(d)) m ^ appear in place of ||/||£m 0(z?iC(d)) . 0r 
the set 

{x G D : \f{x) - f D \ > a} 

may be replaced by {x G D : \f(x) — fo\ > «}, or by {x G D : \f(x) — m\ > a} or 

{x G D : \f(x) — m\ > a} 

where m is a median of / on D. Our remark is this: It is quite straightforward to check 
that such changes give an essentially equivalent inequality. More precisely, if an inequality 
with any or all of these changes holds for all a > then, first of all, this implies that B > 1 
(cf. Remark l2.4p . But, furthermore, this also implies that an inequality exactly of the form 
(I2.13P holds for all a > 0, but with possibly new positive constants b' and B' replacing the 
positive constants b and B. Conversely, if (12 . 1 3[) holds for all a > 0, then each of the above 
mentioned variants of (12. 13[) also holds for all a > 0, again possibly with new constants b' 
and B' replacing b and B. Moreover, if it is necessary to change b and/or B when making 
any of these transitions from one version of ( 12 . 1 3[) to another, then the new constants V and 
B' will always satisfy 6/2 < V < b and 1 < B < B' < e b B. For the reader's convenience, 
we include the calculations justifying these implications in Appendix II 1.21 
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In this paper we begin the development of some techniques for studying BMO type 
spaces which are apparently somewhat different from those used so far. We will use them 
to give an alternative proof of a somewhat abstract result which includes, as special cases, 
both (I2.13P and an analogous result of Wik for special rectangles, which we will describe 
in a moment. Our proof will not be obviously shorter or simpler than the analogous proofs 
in [12] and |3] and \27\ and elsewhere. Nor does it give better constants than the ones 
obtained by previously published proofs. But it seems distinctly possible that, with further 
development and refinement, some elements of our approach here may be able to give new 
information about the behaviour of the constants B and b, for large values of the dimension 
d and perhaps even ultimately lead to determining whether these constants can both be 
taken to be independent of the dimension d. In Wik's analogue of the inequality ( I2.13P in 
the context (12. lip of special rectangles (see (I2.15P ). the relevant constants are independent 
of d. Our proof in the same context also gives constants independent of d, but Wik's 
constants are better than ours. (See Remark 19.31 for details.) 

Let us recall some of Wik's results more explicitly. For each measurable function / : 
M d — > M he defines ||/||' BA//0 to be the seminorm (12.61) where 8 = W(IR d ) is the collection of 
all special rectangles (or "false cubes") in M d , i.e., in our notation 

(^•l^) II/IIbMO = Wf\\BAIO{R d ,W{R d )) ■ 

He defines ||/|| BMO analogously, except that here £ is Q(M. d ), i.e., 

II/IIbMO = H/llBMO(K d ,Q(IR d )) ' 

The particular interest of the seminorm ||/||' BM q ^ es m the following inequality which is 
proved by Wik: 



(2.15) A ({x e Q : \f(x) -m Q \>a})< 2A(Q) exp 



aln2 



16 ll J \\BMO 



(We have written it here using notation slightly different from that of |27| . to make it more 
convenient for comparison with ( 12. 13ft .) This holds for every special rectangle Q, and for 
every a > and for every number mq which is a median of / on Q. As an immediate 
consequence of (I2.15P and another result comparing the seminorms ||/||' BMO and H/H^mO' 
Wik obtains a variant of ( I2.13P which (in view of considerations mentioned in the latter 
part of Remark 12.51 cf. also Lemma I11.5P implies that the original inequality (I2.13P holds 
in fact for 

In 2 , 
b = =- and B = 2e b 

32 (2 + 6 A M 



Here we list some of the features of our approach, some of which have already been 
discussed or alluded to: 

• Probably the most important feature of this paper is that it provides the framework 
for posing "Question A", whose positive resolution would, as we show, give a dimension free 
John-Nirenberg inequality. 

• The "geometrical" component of our proof (Theorem 17. 8p . or the more abstract hypoth- 
esis which can replace it, has to be applied only once in the course of proving our versions of 
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(I2.13j) (in contrast to some other known proofs of analogous results). It is difficult to claim 
that this component is any simpler than the Calderon-Zymund Lemma, or the covering 
lemma of jl] p. 202. But there is perhaps more hope for strengthening it to a dimension 
free version, than there is for doing away with the dependence on dimension in approaches 
based on either of those two lemmata. 

• In one of the decisive steps of our proof of the John-Nirenberg inequality (see Theorem 
18. 3p . we find that in some sense we have reduced our argument to the case where we only 
have to consider functions which take the three values 0, 1 and 2. 

• In some sense, we only have to consider the easy case where d = 1 and the relevant 
set D in (I2.13P is an interval and / is a non increasing right continuous function on that 
interval. The "geometrical" result of Theorem 17.81 which gives an affirmative answer to a 
"dimension dependent" version of Question A, is the tool for reducing the general case of 
functions of d variables to this easy case. 

• Instead of working with the seminorms 

Wf\\BMO{D,Q(D)) ° r Wf\\BMO(D,Q(D)) OT Wf\\BMO(D,Q(D)) 

we mainly use another functional, which we denote by ||/|| BA/0o or ||/||jgjvfom q(d))- This 
functional was introduced by John [11] and then further studied by Stromberg [25| . The 
condition sought in Question A implies an inequality of the form 

11/ IIbMOo,<t(K) — ll/ll J BA/O , s (IR d ) 

for suitable values of the parameter a and a suitable class of functions /. Conversely (see 
Theorem 18. 5p if such an inequality holds for some other appropriate value of a, then it 
implies the condition sought in Question A. 

• Our approach gives a version of the "dimension free" result of Wik [27] for special 
rectangles. 

3. Properties of non increasing rearrangements 

In this section we shall recall some properties of the non increasing rearrangements of 
measurable functions which are defined on an arbitrary measure space (Q, £,/i). Most, 
indeed probably all of these are well known. A detailed discussion of them can be found, 
for example, in [8J. Among other relevant references we mention [3] and [20j. 

For each measurable / : Q — > R, one first defines the distribution function /* : (0, oo) — > 
[0, oo] of / by 

/» = MVen:|/M|>a}). 

One can then define the non increasing rearrangement /* : (0, oo) — > [0, oo) of /, provided 
f*(a) is finite for some positive a. It is given by the formula 

f*(t)=M{a>0:f*(a)<t} 

for each t > 0. It is, roughly speaking, the right continuous "inverse" of the distribution 
function. 

Remark 3.1. In all our applications here, we will only need to consider the non increasing 
rearrangements of functions for which the set on which they are non zero has finite measure. 
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Thus the required condition about the finiteness of the distribution function will always be 
fulfilled. 

Here are the properties of the non increasing rearrangement that we will need in this 
paper. They all hold for any measurable function / : — >■ R for which < oo for some 

a > 0. Recall that we denote the one dimensional Lebesgue measure of subsets G of (0, oo) 
by \G\. 

(i) /* is non negative, non increasing and right continuous on (0, oo). 

(ii) / and /* have the same distribution functions, i.e., they satisfy 

(3.1) \{t > : /*(*) >a}|=/i({uefl: \f(u)\ > a}) for all a G [0, oo) . 

(iii) If /*(/?) < oo for some /3 > then, 

(3.2) \{t > : /*(*) >«}|=|i({u6S]: |/(w)| > a}) for all a G (/3, oo) 
and 

(3.3) |0 > : f*(t) = «}|=|i({uGfl: |/(w)| = a}) for all a G (/3, oo) 

(iv) The set {t > : /*(£) > a} is the open interval (0, f*(ce)) for each a G [0, oo). 

(v) If /i(fi) < oo, then a variant of (13. 3p holds for all a G [0, oo), namely 

(3.4) /i ({lo G f2 : |/( w )| = a}) = \{t G (0,/i(fi)) : f*(t) = a}\ . 

We refer, e.g. to [8] for proofs of properties (i) and (ii). We can easily deduce ( 13. 2 j) from 
( 13. ip with the help of a sequence of numbers {a n } ngN which satisfies /3 < a n < a n+ \ < a 
for each n and also limn^oo a n = a. We have 

|0 > : f* (t) > a n }\ = fj, ({uj G f2 : > a n }) < oo 

for each n and we can apply the contracting sequence theorem. We can then immediately 
obtain (13. 3p by subtracting (13 .ip from (13. 2p . 

Now let us check that property (iv) holds. Since /* is non increasing, the set 

{t > : f (t) > a} 

must be an interval whose left endpoint is 0. In view of property (ii) the right endpoint of 
this interval must be /*(«). If this interval is unbounded then of course it is open. If it 
is bounded and if a > then the continuity from the right of /* implies that this interval 
cannot contain its right endpoint. If a = then the interval is the union of the sequence 
of open intervals {t > : /*(£) > 1/^}, n = 1, 2, .... and is therefore also open. 
It follows from (iv), and the fact that /*(«) < //(fi) for all a G [0, oo), that 

(3.5) {t > : f*(t) >a} = (0, /i(O)) D {i > : /* (t) > a} = {t G (0, /x(ft)) : f*(t) > a} 
also holds for every a G [0, oo) . 
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Here now is the proof of (v). We have /x(fi) < oo, which enables us to obtain (13. 4 ft in the 
case where a = 0, by first using (13. ip and then (13. 5p . as follows: 

^({wGfi:|/(w)| = 0}) = //(fi)-//({w£fi:|/(w)|>0}) 

= |(0 lA *(n))|-|{*>0:r(t)>0}| 

= \(o,n(n))\-\{te(o,nm--f*(t)>o}\ 

= |{f e(0 )A i(n)): /•(*) = 0}| . 

We easily obtain ( I3.4p in the remaining case where a > by first applying (13.31) using 
P — 0, and then observing that in this case 

{t > : /*(*) = a} C {t > : /*(*) > 0} C (0, • 
We close this section with the following lemma which will be needed in Section HI 

Lemma 3.2. Let Q be an admissible subset ofM. d . Let g : Q — >■ [0, oo) fre a measurable 
function. Then, 

\{t £ (0,A(Q)) : \g*(t)-c\ <a}\ = \({x £ Q : \g(x) — c\ < a}) 

(3.6) 

for all c G K. and a// a > . 

Proof. The set which appears on the left side of ( 13. 6p coincides with 

{t G (0, A(Q)) : c - a < < c + a} 

and the set appearing on the right side of ( 13. 6 p coincides with 

{x G Q : c — a < o(x) < c + a} . 

If c + a < then both of these sets are empty. This is because g* is non negative by 
definition, and because we have imposed the condition that / is non negative and therefore 
so is g. Accordingly, we only have to prove ( 13.61) in the case where c + a > 0. In that case 
the above-mentioned two sets coincide respectively with {t G (0, \{Q)) '■ 7 < 9*(t) < 5} and 
{x G Q : 7 < g{x) < 5}, where 7 = max {0, c — a} and 5 = c + a. This reduces the proof 
of (13. 6p to showing that 
(3.7) 

|0 G (0,A(Q)) : 7 < g*{t) < 8}\ = \({x G Q : 7 < ^(z) < 5}) whenever < 7 < 5. 
In order to check that ( 13.71) holds we first note that 

{x G Q : 7 < 5»(x) < 5} = {x G Q : g(x) <5}\{x £ Q : g(x) < 7} 

and 

{x £ Q : g(x) < 7} C {x £ Q : g(x) < 5} . 

Therefore, 

\{x £ Q : 7 < ^?(x) < 5}| = \{x £ Q : ^?(x) < 5}| - |{a; G Q : ^?(x) < 7 }| ■ 
Analogously, we have 

|{f 6(0,A(Q)): 7 <*}l 
= \{t £ (0, X(Q)) : g*(t) < 5}\ - \{t £ (0, X(Q)) : g*(t) < 7 }| • 
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So it will suffice to show that 

(3.8) |0 G (0, A(Q)) : g*(t) < S}\ = A ({x G Q : ^(x) < 5}) 
and 

(3.9) |{t G (0, A(Q)) : g*{t) < 7 }| = A {{x G Q : <7(x) < 7 }) 
We have that 

A ({x G Q : < 5}) = A(Q) - A {{x G Q : ^(s) > 5}) . 

Then, by property (i) of non increasing rearrangements (one among several such numbered 
properties presented at the beginning of this section), since g is non negative, this last 
expression equals X(Q) — \{t > : g*(t) > S}\ which in turn equals 

X(Q)-\{te(0,X(Q)):g*(t)>5}\ = \{t G (0, X(Q)) : g*(t) < 5}\ 

which establishes (13. 8p . If 7 = then we immediately obtain (13.91) since empty sets have 
measure 0. In the remaining case, where 7 > 0, we can first use property (iii) of non 
increasing rearrangements (and again the non-negativity of g) to obtain that 

A ({x G Q : g(x) > 7}) = |0 > : g*(t) > 7 }| = \{t G (0, X(Q)) : g*(t) > 7 }| . 

This, combined with the facts that 

A ({x G Q : g(x) < 7}) = X(Q) -X({xeQ: g(x) > 7}) 

and 

|0 G (0, X(Q)) : g*(t) < 7 }| = X(Q) = \{t G (0, X(Q)) : g*(t) > 7 }| 

immediately gives us (13. 9p and so completes the proof of (|3.7j) . which, as already explained, 
also completes the proof of ( 13. 6 p and therefore of the lemma. □ 

4. The functional of John and Stromberg for characterizing BMO. 

Given an admissible subset E of M d , a real valued function / which is defined and mea- 
surable on E, and a number s G (0, 1), it is convenient to introduce the notation J(/, E, s) 
for a special functional which was introduced and studied in[TT] and then considered in 
greater generality in [25|. Thus we set 

(4.1) J(/, E, s) = inf (inf {a > : A ({x G E : \ f(x) - c| > a}) < sX(E)}) . 

c£R 

(Here, as always in this paper, A denotes ci-dimensional Lebesgue measure on M. d .) In [TTJ 
and [25J the set E is always taken to be a cube, and the functional J(/, E, s) is shown to be 
a kind of counterpart, a very useful counterpart, of the functionals 0(f,E), A(f,E) and 
D(/,£). 

There is another, perhaps more convenient formula for J(f,E,s), namely 

J(f,E,8) = m(f-c)xB) <L) (sX(E)). 

(Here u*^ denotes the left continuous rearrangement of a measurable function u.) This 
formula is mentioned e.g., in [16], |2T] and |23| . In this section we shall obtain yet another 
formula for J(/, E, s) in terms of rearrangements. (See Proposition 14. 5p . 
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In our case E will often be a cube or special rectangle, or, more generally, a member 
of some collection £ of admissible subsets which is used, as in (12. 4p . together with some 
measurable set D C R d , to define a seminorm for some version of the space BMO. Indeed 
for such £ and D, following the model of [TT| and |25| . and analogously to the seminorm 
defined by (12.41) . we consider the functional 

(4-2) \\f\\BA S }o(D, £) --= ™p J(f,E,a). 

Remark 4.1. In the case where D = R d and £ = Q(R d ) it is known [9], [23] that this quantity 
is equivalent to a certain i^-functional. More explicitly, 

\\f\\ { BMoUs) ~ /J BMO(R d , Q(R d )) . 

Despite the choice of notation in (I4.2|) . H/H^mc^d £) * s no ^ a norm nor even a seminorm. 
At least it is homogeneous, i.e., as follows almost immediately from the definition, 

(4.3) J(rf,E,s) = \r\3(f,E,s) and so \\rf\\ { BMO(D,e) = \ r \ II/IIbmcw) for each r e R - 

Let us note another simple property of these functionals: If T is an invertible affine trans- 
formation of R d , i.e., if Tx = rx + Xq for some non zero r G M and x an d if 
g(x) =f(rx + Xo), then a simple routine calculation (see Appendix 1 11. 5p shows that 

(4.4) 3(g,E,s) = 3(f,rE + x Q ,s) 

for each admissible set E contained in the domain of g. Consequently 

(4.5) \\g" {J ' s) -imi( J > s 



BMO(D,£) \\J \\BMO(T(D),T(£)) 

where T (E) is of course the collection of sets {T(E) : E G £}. In various natural examples, 
where D = R d and £ is any one of the collections Q{R d ), V(R d ), B(R d ), )C(R d ) or W(R d ) 
we of course have T(D) = D and T(£) = £ . 

Suppose that D = R d and (as in (1277)1 ) £ is the collection Q(R d ) of all cubes in R d . In 
this case it will sometimes be convenient to adopt the notation of |25| and write 



(4.6) 
and also 



(J,5) 

BMOo.s II J UBMO(SL d ,Q(R d )) 



BMO 



It is known that 
(4.7) 

S \\f\\BMOo s — \\f\\BMO — WfWsMOa . 

for every measurable / : R d — > R, whenever < s < |, where Cd is a constant depending 
only on the dimension d. This result was originally obtained by John [TT] for < s < |, and 
then extended by Stromberg |25j to include the case s — ^. Thus the functional 3(f, Q, s) 
enables one to characterize BMO functions in an alternative way. 

The result ( 14.71) is false for s > 1/2, although the definition (14. ip is valid for all s G (0, 1). 
This is because 3(f,E,s) = and ||/|| SA fOo — for certain non constant functions / 
whenever s > 1/2. (Cf. the remark on p. 522 of |25|.) 
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Remark 4.2. The essential content of ( 14. 7ft is the second inequality. Let us recall the 
elementary proof of (a more general version of) the first inequality of ( 14. 7p . By Chebyshev's 
inequality we have 

A ({x G E : \ f(x) — c\ > a}) < — [ \f - c\ dX = ^-0(f, E) 

a J E a 

for each admissible E, each a > 0, each / which is measurable on E, and each median c of 
/ on E. Thus, every a satisfying a > -0(f,E) also satisfies 

A ({x G E : \ f(x) - c| > a}) < s\(E) 

for some cel. Accordingly, 

3(f,E,s)<^0(f,E) 

which immediately implies that 

1 

s 

The first inequality in (14.71) is a special case of (jj 



( 4 - 8 ) Wf\\BMO(D,£) - " 



The method which we develop in this paper will obviously imply an alternative proof of 
(14. 7p . but (so far) only for quite small values of s. 

We will sometimes need to use the following very simple result. 

Lemma 4.3. Suppose that the function tp : R — > K. satisfies 

\<p(s)-<p(t)\ <\s-t\ 

for all s, t G R. Then 

J(<pof,E,s)<3{f,E,s) 
for every admissible set E, every s G (0, 1), and every real valued function f which is defined 
and measurable on E. 

Proof. This follows immediately from the obvious inclusion 

{{x G E : \<p (f(x)) - <f(c)\ >a}}c{xeE: \f{x) - c\ > a} 
and the definition of J(f, E,s). □ 

Remark 4.4. An analogous result holds for the usual BMO seminorm and functional 
0(f,E). See Lemma 111.61 in Appendix 111.31 

We remark that, for each / , E and s as above, and for each cGl and each a > 0, the 
condition 

A ({x G E : \f(x) -c\> a}) < s\{E) 

is equivalent to 

A ({x G E : \ f(x) -c\< a}) > (1 - s)X(E) . 

So we also have 

(4.9) 3(f, E, s) = inf (inf {a > : A ({x G E : \ f(x) - c| < a}) > (1 - s)X(E)}) . 
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The following proposition gives us another way to calculate and "visualize" J(/, E, s), at 
least for functions which are either univariate and monotone, or non negative. This other 
way, for some purposes, seems to be an easier alternative than working with the original 
definition. It enables us to work with just one variable (here denoted by u), instead of 
having to deal with the two variables a and c in the original definition. 

Proposition 4.5. (%) For each q > and each non increasing right continuous function 
h : (0, q) — » R, the formula 

(4.10) 3(h, (0, q), s) = - inf {h{u) -h(u+(l- s)q) : < u < sq)} 

holds for each s G (0, 1). 

(ii) Furthermore, the formula 

(4.11) J(/, Q, s) = l - inf {(fx Q )* («) - (fXQ)* (« + (1 - s)X(Q)) : < u < s\(Q)} 

holds for each admissible subset Q ofM. d , each s G (0, 1) and each non negative real valued 
function f which is defined and measurable on Q. 

Remark 4.6. In our main applications of this proposition the set Q will be a cube or a 
special rectangle. But we stress that, despite the choice of letter, the set Q in (14. lip can 
be an arbitrary admissible subset. 

Remark 4.7. Restated informally, part (ii) of this proposition tells us that 23(f,Q,s) is 
the "minimum" amount that {fxoT can decrease on any closed subinterval of (0, \{Q)) of 
length exactly (1 — s)X(Q). 

Remark 4.8. It is easy to see from the original definition or from the formula (14. lip , that, 
for each fixed Q and / the function s t— > 3(f,Q,s) is non increasing. As is explained in 
Appendix 111.61 (but is not needed for any other purposes in this paper), s h- >■ 3(f,Q,s) is 
also left continuous, but in general not right continuous. 

Proof of Proposition \4^[ We will first deal with part (i). (To understand the rather simple 
ideas behind our proof of (I4.10p , the reader may care to first look at the rather shorter and 
simpler proof given below in Remark 14.91 for the special case where h is strictly decreasing 
and uniformly continuous on (0,q), and to draw some relevant pictures of the graph of h.) 

Let equal the right side of f)4.10p . We will now prove one "half" of (|4.10j) . namely that 
3(h,(0,q),s) < p. Obviously /3 > and there exists a non increasing sequence {j3 n } ne ^ 
which tends to (3 and a sequence {u n } neN of numbers satisfying < u n < sq such that 

Pn = 2 ( h ( u n) ~ K u n + (1 ~ s)q)) . 

Since u n + (1 — s)q < q and h is right continuous, there exists v n such that u n + (1 — s)q < 
v n < q and 

< h {u n + (1 - s)A(Q)) - h(v n ) < - . 
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If we set c n — 2 (h(u n ) + h{v n )) and a n = \ (h(u n ) — h(v n )) then 
[un, v n ] C {t G (0, g) : h(v n ) < h(t) < h(u n )} 

= {t e (0, g) : c n - a n < h(t) < c n + a n } 
= {te(0,q):\h(t)-c n \<a n }. 

It follows that 

\{t G (0, q) : \h(t) - c n \ < a n }\ > v n - u n > (1 - s)q . 
Consequently (by (14. 9p ) we have 3(h, (0, q), s) < a n for each n. Since 

lim a n = lim /3 n — (3 

this shows that J(/i, (0, g), s) < /3. 

Next we shall prove the reverse of the preceding inequality, namely that (3 < 3(h, (0, q),s). 
Here again we will use sequences denoted by {a n } ngN , {c n } ngN , {w„} n6N and {f„} n6N . But 
they will be defined differently from their definitions in the preceding part of the proof. By 
(14. 9p . there exists a non increasing sequence {a n ,} ngN of non negative numbers which tends 
to J(h, (0, q), s) and a sequence {c n } neN of real numbers such that 

(4.12) |0 G (0, q) : c n - a n < h(t) < c n + a n }\ > (1 - s)q . 

Let us define 

u n := inf {t e (0, q) : h(t) <c n + a n } 

and 

v n := sup {t G (0, q) : h(t) > c n - a n } . 
Then, by definition, for each m G N, we have that 

[u n + 1/m, v n - 1/m] C {t G (0, q) : c n - a n < h(t) < c n + a n } C [u n , v n ] n (0, g) . 

Since we can choose m arbitrarily large, this implies that the intervals 

\Un) V n ) and [u n ,Vn\ D (0, g) 

must have the same length as the interval 

{t G (0, g) : c n - a n < h(t) < c n + a„} . 

In view of (I4.12p . this gives us that v n — u n > (1 — s)g. Furthermore, < u n < v n < q. 
Therefore, for some sufficiently small e n > 0, we have v n — u n > 2e n + (1 — s)g and 

< u n < u n + e n < u n + £ n + (1 - s)q < u n + s n + v n - u n - 2e n = v n - e n < v n < q ■ 

Since the two points u n + £ n and u n + £ n + (1 — s)q are both in (0, g), the number (5 defined 
above satisfies 

2/3 < h (u n + e n ) - h (u n + e n + (1 - s)q) 

< h(u n + E n ) -h(v n - E n ) ■ 

By the definitions of u n and v n this last expression is dominated by 

C n ~\~ Ot n (c n Oin) 2ft n . 

Thus (3 < a n for all n. This gives us the remaining required inequality f3 < 3(h, (0, q),s) 
and completes the proof of (14 . 1 f) and part (i) of the proposition. 
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Now we turn to part (ii) and the proof of the formula (14. lip . We will see that in fact 
( 14.1 ip can be deduced from (I4.10p . essentially by a careful application of the fact that the 
functions / and {fxoTi when restricted to Q and to (0, X(Q)) respectively, have the same 
distribution function. 

The function (fxo)* is non increasing and right continuous on (0, oo) and therefore also 
on the subinterval (0, X(Q)). So, we can set q = X(Q) and h = (fxo)* and apply (14.10j) to 
obtain that 

J((/Xq)%(0,A(Q)), S ) = 

(4.13) 

i inf {UxqT («) - U\qY {u + (1 - s)X(Q)) :0<u< sX(Q)} . 

We remark that we have used the notation {fxof rather than /* in (14. lip because, in 
future applications of this proposition, / might possibly be defined on all of M. d or on some 
other set which is strictly larger that Q. (Indeed the statement of the proposition explicitly 
allows for this possibility.) To simplify the notation in the rest of our proof we will let 

9 = f \q > 

i.e., g : Q — > [0, oo) will denote the function defined only on Q which is the restriction of / 
to Q. Thus we can unambiguously write g* instead of {fxoTi anc ^ °f course J(f,Q,s) = 
J(g,Q,s). In view of ( I4.13p . in order to complete the proof of (14. lip and part (ii) of 
Proposition 14. 5\ it will suffice to show that 

(4.14) J(g,Q,s) = J(g*,(0,X{Q)),s) . 

In view of (14. 9p . we can immediately obtain (I4.14p if we know that 

10 G (0, A(Q)) : \g*(t) - c| < a}\ = X ({x G Q : \g(x) - c| < a}) 

for all c G R and all a > 0. This is exactly the result which was proved in Lemma [3.21 and 
therefore the proof of part (ii) of Proposition 14.51 is complete. □ 

Remark 4.9. Here, as promised above, is the simpler proof of ( I4.10p for the case where h 
is uniformly continuous, and strictly decreasing. In this case h has a unique extension to 
a continuous function on [0, q] which we will also denote by h. For each pair of numbers 
ceR and a > 0, let 

E(c, a) := {t E [0, q] : \h{t) - c\ < a\} = {t G [0, q] : c - a < h{t) < c + a} . 

This set is clearly a closed interval [it, it + r] contained in [0, q], on which h attains a 
minimum value m (at u + r) and a maximum value M (at u), and these values both 
lie in the interval [c — a, c + a]. If we set d = \{M + m) and a' = \{M — m) then of course 
E(d, a') = E(c, a) and < a' < a. Of course the length r of the interval E(c, a) is the 
same as the length of the not necessarily closed interval {t G (0, q) : \h(t) — c| < a\}. So, in 
order to calculate J(h, (0, q),s), we have to consider all intervals E(c, a) which have length 
exceeding (1 — s)q and find the infimum of all values of a which they can have. If, as above, 
we write E(c, a) as [it, u + r], then M = h(u) and m = h(u + r) and 

1 1 

a' = -{M - m) = - (h(u) - h(u + r)) . 

_ 
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Thus J(h, (0, q), s) is the infimum of the set fl of all numbers | {h{u) — h{u + r)) for which 
r > (1 — s)q and 0<u<u + r<g. In view of the continuity and monotonicity of h, we 
can optimally choose r = (1 — s)q, so that the above infimum is equal to the infimum of 
the set Q\ of all numbers \ (h(u) — h (u + (1 — s)q)) for which (]<«<« + (1 — s)q < q. 
(The infimum is of course attained for some particular u G [0, q\.) Again by continuity, this 
infimum is also equal to the infimum of the subset f2 2 of 

^2 = |^ (h(u) -h(u+(l- s)q)) :0<u,u + (l-s)q<q 

This last fact is exactly what is expressed by the formula (I4.10p , and so completes the proof. 

We conclude this section by mentioning two more results, consequences of the formula 
(14. lip , which each apply in the "limiting" case s = 1/2 to any given admissible Q and 
to each measurable real function / defined on Q. We will not actually need to use them 
further in the current version of this paper. The first of these is the inequality 



J(/,Q,l/2) 
> - min I (fx Q )' 



(ML) _ {1XQ y , {jXQ y {MX _ {fXQ y } 

which follows from (14. lip combined with the simple observation that any closed subinter- 
val / of (0,X(Q)) of length X(Q)/2 must contain at least one of the two closed intervals 



HQ) HQ) 

4 ' 2 



and 



HQ) 3A(Q) 
2 ' 4 



For our second result we also note that, since the above in- 
terval I is closed, it must also contain the point X(Q)/2 in its interior. Consequently (14.111) 
also gives us that 

JC/,0, V2) > \ (j™Jfx Q Y w -Ux Q )' (^)) . 

5. NON INCREASING FUNCTIONS OF ONE VARIABLE IN BMO. SOME SIMPLE 

CALCULATIONS. 

Suppose that d — 1, that D is a bounded open interval, and that / : D — y R is non 
increasing and right continuous. Our main aim in this section is to prove that a slight 
variant of the John-Stromberg inequality (namely (15.71) ) holds for these very special choices 
of D and /. 

The proof of (15. 7p in this special case is of course much simpler than any known proofs 
of the John-Nirenberg or John-Stromberg inequalities for the general case. But the results 
of other sections will enable us to deduce the general case from this special case, albeit 
with not particularly good constants, and with restrictions on the range of the parameter 
s appearing in the John-Stromberg functional. 

We obtain (15 .7p as a consequence of the following two lemmata. The first of these bounds 

the functional 11/11^/0(7 Q(i)) by another functional which has been found to be useful in 
various contexts and is more or less connected to the functional sup i>0 f**(t) — f*(t) which 
was introduced in [3]. Other results about these and similar functionals can be found, for 
example, in [TJ [2j [161 122] and in a large number of subsequent papers. 
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For simplicity, we only consider (and in fact only need to consider) the interval / = (0, 1) 
at this stage. 

Lemma 5.1. Suppose that s G (0, 1/2) and p = jf-. Suppose that f : (0, 1) — > R is a non 
increasing right continuous function. Then 

sup (f(pt) - f(t)) < 2 ||/||£? o((0il) . Q((0 ,i))) • 
te(o,i/2] u ' yy "' 

Remark 5.2. Lerner [TB| Theorem 3.1, p. 52] has obtained a much more general result with 
a much more elaborate proof, which essentially implies this lemma. 

Proof. The properties of / permit us to use the formula ( 14.101) of Proposition 14.51 Let (a, b) 
be an arbitrary open subinterval of (0, 1). Via an obvious change of variables (translation, 
e.g. apply (14. 4p with r = 1 and Xq = a) the formula (I4.10p tells us that 

(5.1) J(f,(a,b),s) = -inf {f(a + u) - f (a + u + (1 - s)(b - a)) : < u < s(b - a)} . 

Let [c, d] be an arbitrary closed subinterval of (a, b) of length (1 — s)(b — a). Then d must 
satisfy 

d > a + (1 — s)(b — a) 

and c must satisfy 

c < b - (1 - s) (b - a) . 
From these estimates it follows that 

f(d) < f(a + (1 - s)(b - a)) and f(b -(l- s )(b- a)) < /(c) . 
These estimates imply that 

/(&_(! _ s ) ( 6 _ a ))_/( a + (i_ s )( 6 _ a )) < /( C ) -/(d). 

Taking the infimum over all subintervals [c, d] of (a, b) which have length (1 — s)(b — a) and 
applying (15. ip . we see that 

(5.2) / (6 - (1 - s) (b - a)) - / (a + (1 - s)(b - a)) < 2J(/, (a, b), s) < 2 ||/||&& (JiC(/)) 

whenever < a < b < 1. 

In particular, for an arbitrary t G (0,1/2], let us choose a = and b = Since 
s G (0, 1/2) we have b G (t, 2t) C (t, 1). For these choices of a and b, the left hand side of 
(15. 2 p equals f(pt) — fit). So the proof of the lemma is complete. □ 

Our second lemma enables us to bound the size of our function / by an expression de- 
pending on the functional sup tg( - l / 2 ] if{pt) — f(t) and consequently to obtain an inequality 
which is quite close to the one that we need. 

Lemma 5.3. The inequality 

(5-3) /(«) - f(v) <(l + r^f^) sup (f(pt) - f(t)) 

V log(l/p) / te(o,i/2] 

holds for every non increasing function f : (0, 1) — > M. , every p G (0, 1), and every u and v 
satisfying < u < v < 1/2. 
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As an immediate consequence we obtain 

Corollary 5.4. If f and p are as in the preceding lemma and if 

sup (f(pt)-f(t))<c, 
te (0,1/2] 

then 

(5.4) \{t G (0, 1) : f(t) - /(1/2) > a}\ < i exp f ^^M \ fgr ^ a > g . 

Proof of the lemma and its corollary. Let iV be the unique positive integer for which 
p N v < u < p^v. Then (l/p) N ~ l < I and so N < 1 + j^Jy . Hence 

/(«)-/(«) < f(p lf v)-f(fPv) = y £ i (f(jPv)-f(fr- 1 v)) 

n=l 

< N sup (f( pt )-f(t)). 

(0,1/2] 

This, combined with our estimate for iV, establishes (|5.3|) . 

Now let us prove (j5.4p under the stated hypothesis. Setting ^ = 1/2 in (15. 3p gives us 
that 

(5.5) f(u) - /(1/2) <cTl- i^j^y) ^ all u G (0, 1/2) . 

For each a > 0, the set {t G (0,1) : /(t) — /(1/2) > a} is of course an interval con- 
tained in (0,1/2]. It follows from (15. 5 j) that the length of this interval cannot exceed 

^exp(-^MiM). □ 

The preceding two lemmata and corollary have the following immediate consequence. 
Let / : I — y R be a non increasing right continuous function on the interval / = (0, 1). 
Then, for each s G (0, 1/2], 



(5.6) 



tel: fit) 



/(i 



> a 



< • exp 



a 



log (1 " 1) 



(j, s ) 

BMO(I,Q(I)). 



for all a > . 



Note that here we can also permit s to take the limiting value s — 1/2 and we can permit 
H/llsMOff q(/)) to be infinite (provided we agree to interpret both l/oo and O/oo as 0). In 
such cases the right hand side of ( 15. 6 j) is greater than or equal to 1/2 which means that 
( 15. 6 p is also true, trivially so, in these "limiting" cases. 

It will now be a very simple matter to deduce a more general version of ( 15. 6p for the case 
where I is an arbitrary open interval (a, b). Suppose that / : (a, b) — > M is right continuous 
and non increasing. Define g : (0, 1) — > R by p(£) = f{a + [b — a)t). Then (cf. (14. 4p and 

gSD) we have \\f\\ { BMO(i,Q(i)) = 11^ II sA^o((o,i),Q(o,i)) • Furthermore, the set 



E 1 = \tE(a,b):f(t)-f 



> a 
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coincides with the set (b — a)E2 + a where 

E 2 = {x G (0,l):g(x)- g(l/2) > a} . 

Therefore, applying 05. 6 p to the function g and multiplying both sides of the resulting 
inequality by b — a = \I\ gives us that 

1 — s ( aloe (- — i) 

(5.7) \{tel: ffl - f ( CJ ) > a}| < L-l . |/| . ex p ' 



2s \ 2 II f ll (J,s) 

* \\J \\BMO(I,Q(I)) , 

for every a > and every s G (0, 1/2] and for every open interval /, where cj denotes the 
midpoint of /. 

This is the inequality that we need to apply in the proof of our main result, Theorem 19. II 
of Section |9j 

One immediate consequence of ( 15. 7p together with the inequality H4.8[) recalled in Remark 
H~2l is that 



1 — s I as log f- — l) 

(5.8) |0 G I : f{t) - f ( Cl ) > a}| < _ ■ |/| • exp ' 



BMO(I,Q(I)) i 

Remark 5.5. The inequality (15. 7p is not exactly the John-Stromberg inequality, and (15. 8p 
is not exactly the John-Nirenberg inequality for the non increasing function / on the in- 
terval /, since the John-Stromberg and John-Nirenberg inequalities are for the measure 
of the sets {t G / : \f(t) — f (c/)| > a} and {t G / : \f(t) — fi\ > a} respectively, where 
fi = tjt fjf(t)dt. But a simple argument (cf. the proof of Lemma l6.ip enables us to 
replace f{x) — f(cj) by \ f(x) — f{ci)\ in the left hand side of (15. 7p . provided that we also 
multipy the right hand side by 2. The possibility of replacing /(cj) (which is of course the 
median of / on J) by // has already been discussed in Remark 12.51 This replacement can 
be made, again at the price of increasing the constant on the right hand side. (More details 
about doing this are given in Lemma 111.51 of Appendix 111.21 ) We recall that a version 
of the John-Nirenberg inequality is given in [14] in which it is shown, for d = 1, that the 
optimal value of the constant b in (I2.13P is 2/e. See also |15| p. 77. It is not difficult to 
check that the inequality obtained from (15.81) by applying these simple steps does not have 
this optimal value for b. 

6. A REDUCTION OF THE PROOF OF THE JOHN-STROMBERG THEOREM TO A SPECIAL 

CASE. 

Having, in the previous section, prepared the auxiliary results that we need about special 
functions of one variable, we now turn to consider functions of several variables. 
In our (very slightly) different notation, Lemma 3.1 on p. 517 of j25] states that 



(6.1) \({xeQ: \f(x)-m f (Q)\ > a}) < CX(Q) exp 



ca 



BMOo,, 



for all a > and s G (0, 1/2]. 

Here C and c are positive constants depending only on d and m/(Q) is a median of / on 
the arbitrary cube Q in M. d . 
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(Here we are again using the notation specified in ( 14. 6ft . Note that there is a small 
misprint in [25], namely the factor X(Q) (or \Q\) has been omitted there.) 

Our main goal in this section is to show that, in order to prove the inequality (16.11) for 
the specified values of a and s, and some other inequalities like it, it suffices to obtain such 
inequalities, but with different values of the constants c and C, in the special case where / 
is a non negative function taking only integer values. This fact will be precisely formulated 
as Theorem 16.51 (The question of whether such inequalities actually do hold in that special 
case will be deferred to Section [9j We will be able to answer it there, with the help of 
results from other sections.) 

Here we can just as easily work in the rather more general context of the space BMO(D, S) 
of Definition 12.21 Indeed, doing so will be convenient, since we will later want to apply the 
result of this section in such a general context, which will include, for example, the par- 
ticular case of special rectangles (12. lip as well as the case of usual cubes (12. 7p . Thus, 
throughout this section D will denote some arbitrary but fixed measurable subset of M. d 
and 8 will denote some arbitrary but fixed collection of admissible subsets of D. 

Our first (easy) step is to reduce everything to the case of non negative functions. 

Lemma 6.1. Let E be a fixed admissible set in S, let s be a fixed number in (0, 1/2], and 
let c and C be positive constants. Suppose that every non negative measurable function 
f : D — > [0, oo) satisfies the inequality 

(6.2) A ({x e E : f(x) > a}) < C\{E) exp ( ^ j for all a > . 

V \\f\\BMO(D,e)J 

Then every measurable function f : D — > R satisfies 



cct 

(6.3) A ({x G E : \f(x) - m\ > a}) < 2CX{E) exp I rfl 1 for all a > 



BMO(D,£) . 



\\f Ui 

whenever m is a median of f on E. 

In fact this same conclusion also holds under weaker hypotheses, namely if it is only 
known that A6.2\) holds for those non negative functions f : D — > [0, oo) having the additional 
property that 

(6.4) \({xeE:f(x)>0})< 1 -X(E). 



Remark 6.2. Our "natural" applications of Lemma [6. 1[ will be in the case where the collec- 
tion S includes the set D itself, and we choose E = D. 

Proof. We have to prove (16. 3p for an arbitrary measurable function / : D — » K with median 

m on E. Let g = f-m. Obviously J(g,E,s) = 3(f,E,s) and |M|jJj£o(£>,fi) = H^Hbmo(d,£) 
and so it will suffice to prove that 

/ cct 

(6.5) \({x e E : \g(x)\ > a}) < 2C\{E) exp [ ^ ) for all a > . 

\\9\\bmo(d,£) 
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The left hand side of ( I6.5jl equals 

A ({x G E : g(x) > a}) + A ({x G E : g(x) < -a}) 
= A ({x G E : g+(x) > a}) + A {{x G E : £-(a;) > a}) 

where, as usual, g + = max {g, 0} and g_ = g + — g = max {—g, 0}}. We can apply Lemma 
14.31 with ip(t) = max {t, 0} to obtain that 

II- ||(J,«) < ||„||( J > S ) an H ||„ ll (J ' s) < ll-nll (J ' s) 

\\y+\\BMO(D,S) — \\y\\BMO(D,£) miu 11^- H BMO(D,£) — II y\\BMO(D,£) m 

Obviously || —g\\BMO(D e) = IMIbmo(£> £) • Thus, if we apply (16. 2p to each of the non negative 
functions g + and g_ and sum the results, we obtain (16.51) and therefore that / indeed satisfies 

It remains to justify the claim in the last sentence of the statement of the lemma. Since 
m is a median of /, it follows that is a median of g and so the two functions g + and g_ 
to which we have applied (16. 2p satisfy 

A ({x G E : g+(x) > 0}) < -X(E) and A ({x G E : g_(x) > 0}) < -X(E) . 

□ 

Our next step is to reduce the proof of (16. 2\i to the case of appropriate integer valued 
functions. 

Lemma 6.3. Let E, S, s, c and C be as in the statement of Lemma \6.1\ Suppose that 
/ 1 6. fy) holds for every non negative measurable function f : D — y [0, oo) which takes only 
integer values and satisfies \\f\\^MO(D e) — an d ljo-4\ )- Then \6. jjj) also holds for every 
measurable f : D — > [0,oo), which satisfies UhM , but with the constants C and c replaced 
by c\ = c/4 and C\ = max {C, e c }. 

Remark 6.4. It will be clear from the following proof that we can also obtain the following 
additional result: Suppose that in the above lemma we impose the stronger condition that 
(16. 2 p holds also for every non negative measurable function / : D — > [0, oo) which takes only 
integer values and satisfies ||/||sMO(r>£) — but does not necessarily satisfy ( 16. 4p . Then 
we obtain the stronger conclusion that ( 16. 2 p holds for every measurable / : D — > [0,oo), 
but with the constants C and c replaced by c\ = c/4 and C\ = max{C, e c }. I.e., in this 
case we can obtain ( 16. 2 p also for functions / : D — y [0, oo) which do not satisfy ( 16. 4|) . 



, < t < 1/2 

n , n - 1/2 < t < n + 1/2 for each n G N 



Proof. We shall use the function ip : [0, oo) — > [0, oo) which is defined by 

<p(t) = 

We will need the following three obvious or easily verified properties of tp: 

V9([0,oo)) = NU{0} , 

(6.6) ip(t) - ip(s) G {0, 1} whenever 0<s<t<s + l/2 
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and 

(6.7) <p(t) > t- 1/2 for aU t > 0. 



Suppose that / : D — > [0, oo) is an arbitrary measurable function which satisfies ([6? 
and also 

(6-8) 0<\\f& ) o ( p#<l- 
The composed function ip o f also satisfies (" 16 .4[) since 

A ({x <E E : ip o f(x) > 0}) = A ({a; G £ : <pof(x) > 1}) 

= \({xeE:f(x)>l/2})< 1 -\(E). 
We will next show that, furthermore, ip o / satisfies 

(6-9) IIWIlSkcA*)^ 1 ^. 

Let W be an arbitrary set in £ . Then (16. 8p implies that J(f, W, s) < 1/3. Therefore 
(cf. (14.9P ) there exists some a G [0, 1/3) and some 7 G M such that 

(6.10) A ({x G W : |/(x) - 7| < a}) > (1 - s)A(W) . 

Let us choose 

7i = J 0(7 - «) + ¥>(7 + «)) and a i = \ (v(7 + «) - V(7 - «)) • 

Since < a < 1/3 we obtain from (16. 6 \ that «i is either or 1/2. Since ip is non decreasing, 
we also obtain that 



1 — en < f(x) < 7 + a} 
V(l ~ a ) < V f(x) < (fd + a)} 
7! - a x < <f o f(x) < 71 + ax} 
\ip o f(x) - 7i| < ax} . 



{x G W : -7| < a} = {x e W 

C {2 G W 
= {x G W 

= {xeW 

Thus we deduce, using (I6.10p and (14. 9 p once more, that 

J{ipof,W,s) <ax < 1/2 

and this establishes (16. 9p . 

Since the function tp o f is also non negative and integer valued, we have, according to 
the hypotheses of the lemma, that 

(6.11) A ({x G E : <p o f(x) > a}) < CX(E) exp ( I for all a > 



I XII ( J :S) 

\<P°f U 

The inequality ( 16.7ft implies that 



BMO{D,S) . 



{x E E : f(x) > a} C {x E E : Lf o f(x) > a - 1/2} 
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and consequently, for all a > 1/2, it follows, using ( 16. Ill) and then ( \6.9h and then ( 16. 8ft . 
that 



A {{x G E : f(x) > a}) < CX(E) exp 



c(a - 1/2) 



IV 9 ° f\\BMO(D,£). 



< t7A(£)exp(-2c(a- 1/2)) . 

If we now restrict a to the range a > 1, we also have a — 1/2 > a/2 and so 

A ({a; G E : /(x) > a}) < CX(E) exp (— ca) < C X X(E) exp (— ca) , 

recalling, as stated in the lemma, that Ci = max{C7, e c }. 

Now let us consider the range of values < a < 1. Of course 

A ({x G E : /(x) > a}) < A(£) 

for these (and all other) values of a. Furthermore, for each a G [0, 1) we have Cie _cct > 1 
and therefore 

A ({x G E : f(x) > a}) < d\(E) exp (-ca) . 
We have thus now shown that, subject to the hypotheses of the lemma, the inequality 

(6.12) A ({x G E : f(x) > P}) < C^E) exp (-c/3) 

holds for all /3 > and for all those measurable functions / : D — > [0, oo) which satisfy 

(6-13) 0<||/||^ / ) o(Di , ) <1/3 

and ( 16. Ah . Now we can easily obtain the required inequality (16 .2h without having to impose 
( I6.13p . Given an arbitrary measurable function /:£)—>■ [0, oo) satisfying ( 16 .4p and any 

a > 0, we let /= f/4 \\f\\BMO(D,e) and cnoose P = «/ 4 II/IIbmo(A£)- Then by homo g eneit y 

(cf. (14.31) ). we have / =1/4 and of course / also satisfies (16.41) . So we can apply 

BMO{D,£) 

(I6.12p to / and obtain (16. 2p . completing the proof of the lemma. □ 

We can summarize the results of this section by the following theorem, whose proof 
follows immediately from the previous two lemmata. 

Theorem 6.5. Let E be a fixed admissible set in S, let s be a fixed number in (0, 1/2], and 
let c and C be positive constants. Let $ be the set of all non negative measurable functions 
/:£)—)■ [0, oo) which take only integer values, and which satisfy \\f\\^Mo{D s) — an d 
A ({x G E : f(x) > 0}) < \\{E). Suppose that every / G $ satisfies 

(6.14) A ({x G E : f(x) > a}) < C\{E) exp j ^ J for all a > . 

V \\f\\BMO(D,£)J 

Then every measurable function / : D — > R satisfies 

l cot 

(6.15) A ({x G E : - m| > a}) < 2 max {C, e c } A(E) exp 
/or a// a > 0, whenever m is a median of f on E. 



BMO(D,£), 
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7. The "geometrical" component of our proof 

7.1. A "balancing act" between two subsets of a cube. The "bi-density" constant. 

We begin by stating a simple result which is a sort of "prototype" of the main result that 
we seek in this section. It will also be a tool for proving that main result. 

Lemma 7.1. Let Q be a cube in M, d and let E be a measurable subset of Q such that 
< X(E) < X(Q). Then there exists a cube W contained in Q such that 

(7.1) X(W \ E) = X(W n E) = -X(W) . 

We can just as easily prove a slightly more general result. See Lemma [7.51 below. Setting 
s = 1/2 in that lemma will give the result just stated here. 

If the cube Q in Lemma 17.11 is dyadic, and we want to only consider subcubes W which 
are also dyadic, then (as is made explicit below in Remark 17.31) we cannot hope in general to 
obtain one of them which satisfies ( 17. ip . Instead, as the next lemma states, we can obtain 
a dyadic subcube W satisfying a rather weaker property. 

Lemma 7.2. Let Q be a dyadic cube in M. d and let E be a measurable subset of Q such that 
< X(E) < X(Q). Then there exists a dyadic cube W contained in Q such that 

(7.2) min {X(W \ E), X(W n E)} > 2~ d (l - 2~ d )X(W) . 

Proof. Clearly any measurable set W (whether or not it is a dyadic cube) satisfies the 
condition 

(7.3) 2~ d X(W) < X{W\E) < (1 -2~ d )X(W) 
if and only if it satisfies 

(7.4) 2- d X(W) < X(W n E) < (1 - 2~ d )X(W) . 

Any cube W satisfying these conditions of course also satisfies (17. Thus the only case 
which we have to consider is when (17. 3p and ( I7.4p are not satisfied for any dyadic subcube 
W of Q. Then, in particular, Q itself must satisfy either 

(7.5) A(Q n E) < 2~ d X{Q) 

or X(Q fl E) > (1 — 2~ d )X(Q). We can and will suppose that the former of these two 
conditions holds. If the latter condition holds then the proof is exactly the same, except 
for an interchange of the roles of the sets E and F where F = Q \ E. 

Let x be a density point of E in the interior of Q. By an appropriate version of the 
Lebesgue differentiation theorem, (see e.g. [2S] Theorem 7.16 pp. 108-109) there exists a 
dyadic cube U such that x G U C Q and 

(7.6) X{UnE)>{l-2~ d )X{U). 

Consider the finite sequence of dyadic cubes {U n }^ =1 for which U\ = U and Um = Q and 
which satisfy U n C U n+ i and X(U n+ i) = 2 d X(U n ) for each n = 1,2, ...,N — 1. In view of 
(17. 5p and (17. 6p the inequality 

(7.7) X(U n nE) > (l-2- d )X(U n ) 
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holds for n — 1 but not for n = N. So there must exist some integer n with 1 < n < N — 1 
for which (17. 7p holds and also 

(7.8) X(U n+1 n E) < (1 - 2~ d )X{U n+1 ) . 

Since we have excluded the possibility that U n+ \ satisfies (I7.4p . it follows from ( 17. 8p that 
X(U n+1 n E) < 2- d \{U n+ i), or, equivalently, that X(U n+1 \ E) > (1 - 2^ d )A(f/ n+ i). Using 
(17. 7p . we see that 

X(U n+1 HE)> X{U n HE)> 2~ d (l - 2- d )X(U n+1 ) 
and so the dyadic cube W = U n+ \ satisfies ( 17. 2p . □ 

Remark 7.3. We have not bothered to check whether it is possible to obtain a stronger 
conclusion in Lemma 17.21 where the constant 2~ d (l — 2~ d ) in (17. 2p is replaced by some 
larger constant. However any such improvement would not be very significant, since the 
simple example where E is a dyadic subcube of Q shows that the constant in (17. 2p cannot 
exceed 2~ d . 

Referring back to the terminology introduced in (1277]) . (T2~8|) . (|2~9|) . (12710]) and (127TT]) we 
can see that Lemma 17.11 gives us information about the collection Q(D) of subcubes of 
D, and Lemma 17.21 gives us analogous information about the collection T>(D) of dyadic 
subcubes of D. The following notion will put the results of these two lemmata in a more 
general context. 

Definition 7.4. Let £ be a collection of admissible subsets of M. d . We say that a number 
S is a bi-density constant for £ if, for each Q G £ and for each measurable set E for which 

< X(Q fl E) < X(Q), there exists some set W G £ with W C Q such that 

min {A(W n E), X(W \E)}> 5X(W) . 

Thus Lemma 17.11 tells us that 5 = 1/2 is a bi-density constant for Q(D). The next 
lemma will show (when we substitute s = 1/2) that this is also the case for the collections 
K.{D) and W(D) (defined above in ([2710]) and (I27TT]) ). The value 1/2 is in fact optimal 
since, clearly, any bi-density constant for any collection £ always has to satisfy 8 < 1/2. 
Lemma [7.21 and Remark 17.31 tell us that every bi-density constant S for T>(D) must satisfy 
2~ d (l - 2~ d ) < 5 < 2~ d . 

Lemma 7.5. Let K be a convex subset of M. d with non empty interior and let E be a 
measurable subset of K such that < X(E) < X(K). Then, given an arbitrary number 
s G (0, 1), there exists a cube W C K for which 

(79) \(WnE) ^ 

1 ' \(W) 

Proof. Since the boundary of a convex subset of M. d always has measure zero (see Appendix 
lll.4p . we may assume without loss of generality that E is contained in the interior K° of 
K. 

By the Lebesgue differentiation theorem, almost every point of the set E is a point of 
density. So there exists a subcube W a of K centred at one of these density points for which 

XEdX > s . 



X(W a ) 
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Similarly there exists another subcube Wb of K centred at some density point of F :— K°\E 
for which 



Now we define a family {W(t)} te r 11 of subcubes of K such that W(0) = W a and W(l) = W b 
and such that W(t) varies continuously as a function of t on [0, 1]. It is easy to see how to 
do this, but we will nevertheless give an explicit recipe. 

Let c(a) and c(b) be the centres of W a and Wb respectively, and let r(a) and r(b) be the 
side lengths of W a and Wb respectively. For convenience, and without of loss of generality, 
we may suppose that r(a) < r(b). 

If r(a) = r(b) then we simply define W(t) to be the cube of sidelength r(a) centred at 
the point (1 — t)c(a) + tc(b) for each t G [0, 1]. 

If r(a) < r(b) then we first define W(t) on [0, 1/2] by letting it be the cube of sidelength 
r(a) centred at the point (1 - 2t)c(a) + 2te(6). Then, for t G [1/2, 1] we let W(t) be the 
cube centred at c(b) with sidelength r(a)(2 — 2t) + r(b)(2t — 1). 

In both of the above cases, the fact that K is convex guarantees that W(t) C K for all 



fore it must take the value s at some point t* G (0, 1). The cube W — W (t*) satisfies (17 .9|) . 
□ 

7.2. Some preparations for a "balancing act" between three subsets of a cube. 

In the previous subsection we considered cubes Q which are the unions of two disjoint sets 
E and Q\E which both have positive measure, and we have obtained a subcube W of Q 
whose intersections with E and with Q\E are both "significant" proportions of W. Our 
main goal in the sequel will be to "upgrade" these kinds of results to a situation where the 
cube Q (or a more general set) is the union of three disjoint sets, which we may denote by 
E + and EL and G. We will show (in Theorem I7.8P that, under certain conditions, there 
is a kind of "tri-density constant". Let us try to express this a little more explicitly: If G 
is a "relatively small" part of Q then we will show that there is a subcube W of Q whose 
intersections with E + and EL are "significant" proportions of W . We will formulate this 
result in a more general context where Q and W are not necessarily cubes, but are members 
of some suitable collection £ of admissible sets. In Section [8] we will see the implications of 
this property of three subsets for the study of various versions of BMO. 

In order to express our main result for a more general choice of collections S of admissible 
sets, we need to define some more notions. Our point of departure for doing this comes 
from considering two important examples: 

(i) Every cube Q in R d is of course the union of 2 non overlapping subcubes, each having 
volume 2~ d X(Q). Then each of these subcubes can of course in turn be subdivided into 
2 d non overlapping subcubes of volume 2~ 2d \(Q), .... and this process can be continued 
indefinitely. 

(ii) Every special rectangle Q in R d is of course the union of 2 non overlapping special 
rectangles, each having volume 2~ 1 X(Q). Then each of these special rectangles can of course 
in turn subdivided into 2 non overlapping special rectangles of volume 2~ 2d X(Q), .... and 
this process too can be continued indefinitely. 




te [o,i]. 

It is easy to see that t H- 




XEdX is a continuous function of t on [0, 1]. There 
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Here is a notion which incorporates these two examples, and ultimately other examples. 

Definition 7.6. Let E be an admissible set in M. d . Let M > 2 be a positive integer. We 
will say that the doubly indexed sequence {Ej^} >0 1<k<M j of admissible sets is a multilevel 
decomposition of E with multiplicity M if it satisfies the following conditions: 

(i) Eq i = E, and, more generally, 

E = U E i< k 

k=X 

for each fixed j > . 

(ii) For each fixed j > 1 the sets Ej^ satisfy A {Ejj, n E^y) = whenever k ^ k'. 
(hi) A(S 7 - jfc ) = M~ j X(E) for each j > and k G {1, 2, „..j M- 7 } . 

(iv) For each fixed j > 1, and G {1,2,..., M J }, the set is the union of M sets from 
among the M J+1 sets Ej + i m . More explicitly, 

= U E 'j+l,rn ■ 

m=M{k-X)+X 

(v) The diameters of the sets Ej^ tend to zero uniformly as j tends to infinity, i.e., 

(7.10) lim I max dianxE, I =0. 

j^oo \X<k<Mi ' / 

The preceding definition leads us immediately to this next one. 

Definition 7.7. Let S be a collection of admissible subsets of M. d and let M > 2 be an 
integer. We will say that S is M -multidecomposable if every set E G £ has a multilevel 
decomposition {-By,fe} J>0 i<fc<Ap °f multiplicity M where all of the sets Ej^ are also in S. 

So, of course, for any open subset D of M d , the collections Q(D) and V(D) are both 
2 d -multidecomposable, and the collection W(D) is 2-multidecomposable. It is probably 
easy to show that the collection JC(D) is also 2-decomposable. 

7.3. Our main "geometrical" result. The promised "balancing act" between three 
subsets of a (generalized) cube. 

Theorem 7.8. Let £ be a M -multidecomposable collection of admissible subsets ofM d for 
some M > 2. Let r be a positive number. Let S be a bi-density constant for 8. 

Suppose that Q is a set in S and there exist three pairwise disjoint measurable sets E + , 
E- and G which satisfy 

Q = E + U E_U G 

and 

(7.11) min{A(£ + ),A(£_)} > r\(G) . 
Then there exists a subset W of Q such that W G S and 

(7.12) min{A(£+ n W),X{E_ n W)} > sX(W) 
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where 

f ( _ ^2 ^ 

< T < y/2-1 





f • i 

mm < 


M(1 + t)'*J 


s = < 


min < 


\3-2V2 \ 

l ^ '7 



y/2-1 < 



T . 



It will be convenient to explicitly state some immediate consequences of Theorem 17.81 for 
some special choices of Q and £ , 

Corollary 7.9. Suppose that Q is, respectively (i), a cube, or (ii) a dyadic cube or (Hi) a 
special rectangle in M. d and that, respectively, 

(i) £ = Q(Q), or (ii) £ = T>(Q), or (Hi) £ = W(Q). Suppose that Q is the disjoint union 
of the three sets E + and G and that 



min{A(£ + ),A(£_)} > (y/2 - l) X(G) 



Then there exists a set W C Q which is, respectively, (i) a cube, or (ii) a dyadic cube or 
(Hi) a special rectangle, and which satisfies 

min {X(E + n W), A(£_ n W)} > s\(W) 

where, respectively, (i) s = 2~ d (3 — 2y/2) , or (ii) s = 2~ d (3 — 2y / 2) (again), or (Hi) 
s = (3-2^/2. 

Proof of Corollary \7.9i . We simply apply Theorem 17.81 substituting the known values for S 

and M in the formula s = min j 3 ~^/^ , s\ in each of the three cases. □ 

Theorem 17.81 and Corollary 1 7. ,91 motivate us to introduce another notion. This notion 
will enable a convenient formulation of the main question raised by this paper, and also 
a convenient proof of the consequences that an affirmative answer to that question would 
have. 

Definition 7.10. Let £ be a collection of admissible subsets of M. d . Let r and s be positive 
numbers with the following property: 

Let Q be a set in £ and let E + , E_ and G be arbitrary pairwise disjoint admissible sets 
whose union is Q. Suppose that 

(7.13) min {X(E + ), X(E_)} > tX(G) . 
Then there exists a set W C Q which is also in £ and for which 

(7.14) min {X(E + n W), A(£L nW)}> sX(W) . 
Then we will say that (r, s) is a John-Strdmberg pair for £. 

Remark 7.11. The preceding definition is formulated for all possible positive values of r and 
s. However, for our particular applications we are interested only in cases where r < 1/2. 
This is why, in the formulation of Question A, we apply this latter restriction to r . 
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Example 7.12. We can reformulate Corollary 17.91 as follows: We take r = \/2 — 1. Then, 
for any open set D of M. d , we have that 

(i) (V2 - 1, 2^ d (3 - 2y/2)) is a John-Stromberg pair for Q(D) and also for V{D). 

(ii) (^/2 - 1, 3 ~2^ J is a John-Stromberg pair for W(D). 

Remark 7.13. In the special case where £ = Q{D) for some open subset D of R d then, 
obviously, (r, s) is a John-Stromberg pair if and only if the condition appearing in the 
second paragraph of Definition 17. 101 holds for just one particular cube Q in M. d , for example 
for the unit cube Q = [0, l] d . The cube Q does not have to be contained in D. (In fact the 
particular choice of D is irrelevant in this case, (r, s) is a John-Stromberg pair for Q{D) if 
and only if it is a John-Stromberg pair for Q(M. d ).) 

Remark 7.14. We can now express Question A concisely in the language of Definition 17. 101 
Question A simply asks whether there exist two absolute constants s > and r G (0, 1/2) 
such that (r, s) is a John-Stromberg pair for for every d G N. 

Remark 7.15. In Definition 17.101 the set G may be chosen to be empty. Therefore any 
number s, which happens to form a John-Stromberg pair (r, s) for £ with some positive 
number r, will also automatically be a bi-density constant for £. The particular value of r 
is immaterial here. 

Remark 7.16. Although the only restriction that we explicitly impose on s is that it has to 
be positive, we see from (17. 14p that some cube W, which has positive measure, must satisfy 

\{W) > X(E + f]W) + A(£_ n W) > 2 min {X(E + n W), A(£_ C\W)} > 2s\(W) . 

Thus the constant s in the above definition can never be greater than 1/2. 

Remark 7.17. The first of the two inequalities which appear in Definition 17. 101 (and also in 
Question A) is strict and the second is not. But would it really change anything if both or 
neither of them were strict? It would seem that not. We refer to Remark ll0.3l in connection 
with this issue. 

Proof of Theorem \7.8l In view of the regularity of A there exist compact subsets H + and 
H_ of E + and E_ respectively such that 

min{A(i/ + ),A(#_)} > t\(G) . 

If there exists W G £ such that W C Q and 

min {X(H + n W), A(#_ D W)} > sX(W) 

then obviously W also satisfies (I7.12p . This means that we may assume without loss of 
generality that E + and EL are themselves compact sets. Since they are also disjoint, it 
follows that 

p := dist {E + ,E^) > 0. 
We will say that the set W is a good set if W G £ and W C Q and 

min {X(E + n W), X(E_ n W)} > rX(G n W) . 



THE JOHN-NIRENBERG AND JOHN-STROMBERG THEOREMS FOR BMO 



31 



(We mention that a slight variant of this definition will play a role later, in Subsection 
110. 2\ in particular in the proof of Theorem 110. 6\ which will have some similarities with 
some of the arguments here.) 

Let {Qj,k} >o i<k<AP ^ e a multilevel decomposition of Q of multiplicity M where all 
of the sets Qj^ are in £. A sequence {Q } 0<n< g °f se ^ s in where £ can be finite or 
infinite, will be called a chain if Q(°> = Q and, if for each n such that 1 < n < £ we have 
Q(") = Q n kn for some integer k n G {1,2, ...,M n }. When £ = oo, we have (by (fTTUjl ) that 
limn-s.oo diamQ ( - n - ) = 0. So there must exist some ni such that diamQ^ < p for all n> n±. 
Thus, for n>n\ the set cannot intersect with both of E + and EL and so cannot be a 
good set. 

Let construct a particular chain {Q^} 0<n< £ m the following way. We of course have to 
start with Q<°) = Q. If among all the sets Qi,fc for k G {1,2, ...,M} there is no set which 
is good, then we set £ = 1 and our construction is complete. Otherwise we choose to 
be a good set from the above list. Next we check whether, among those of the sets Qi,k for 
k G {1,2,..., M 2 } which are contained in Q^ 1 ' , there is one which is a good set. If so we 
choose to be such a set. If not, we set £ = 2 and our construction is complete. The 
continuation of this process is now clear. At the nth stage we seek a good set, which we 
will call Q^ n \ from among those of the sets Q n ^ which are contained in Q < - n_1 ' ) . If no such 
set exists, then we set £ = n and the construction is complete. In view of the arguments 
presented in the previous paragraph, we must have £ < oo, i.e., the construction necessarily 
has to terminate after finitely many steps. 

Thus we have obtained a good set Q^~ x > which is one of the sets Qe-i^k for some integer 
k and it will enable us to complete the proof of the theorem. Let us denote the M sets 
of the form Qe jm which are contained in by Wi, W 2 ,...., W M . By our construction, 

none of these sets are good sets. For the rest of the proof we will assume that 

If the reverse inequality holds then we will simply use an exact analogue of the proof that 
we are about to give, where we will simply interchange the roles of E + and E_. 
We have to consider three cases: 

Case (i). Suppose that A(Q^" 1) D G) = 0. Then, since 5 is a bi- density constant for £ , 
and since A(Q (£ " 1) HE + ) and \(Q { ^ 1) \E+) = A(Q ( ^ 1) n£l) are both positive, there exists 
W G £ such that W C Q ( ^ 1} and 

(7.15) min {\{W n E+), \{W nfi_)}> 5X(W) > s\(W) , 

completing the proof of the theorem. 

Case (ii). Suppose that \{W m n EJ) = \{W m ) for some m G {1, 2, M} . 
Then 

\(Q^DE + ) > X(Q^nE^>X(W m nE^ = X(W m ) = M- 1 X(Q^) 

So we see that in this case the set W = Q^ 1 ^ has the properties required to complete 
the proof of the theorem. 
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Case (iii). This is the remaining case where cases (i) and (ii) are excluded. I.e., we have 
\(Q {£ -V D G) > and 

(7.16) \{W m n EJ) < \{W m ) for all m G {1, 2, M} . 
The inequality 

A(Q ( ^ 1} n EJ) > t\(Q.v-v n G) > 

(which holds because Q^ -1 ** is good) can be rewritten (in view of condition (ii) of Definition 
USD as 

Af M 

(7.17) £ A(W m n EJ) > rA ( W m n G ) > • 

m=l m=l 

Let iV be the (possibly empty) set of all integers m in {1,2,..., M} which satisfy 

x(w m nG) = o. 

If X(W m nE-) > for some m G iV, then, since (I7.16P also holds, we can again, analogously 
to what was done in Case (i), invoke the bi-density condition to obtain some W G S with 
W C W m which satisfies (I7.15P and so completes the proof. This means that we can now 
assume that X(W m (1 EJ) = for all m G N. Therefore (I7.17P can be rewritten as 

£ \(w m nE_)> Yl r\(W m nG)>0. 

me{l,2,..,AJ}\/V me{l,2,..,M}\N 

It follows that there exists at least one m G {1,2,..., M} \ N which satisfies 

\(w m n EJ) > r\(w m nG)>o. 

Recall that, by our construction, W m is not good. Therefore we must have 

X(W m nE + ) <r\(W m nG). 
We use this and the preceding inequality to obtain that 

x(w m nEj) > rX(w m n G) = r [X(w m ) - x(w m n e_) - x(w m n e+)] 

> T [X(W m )-X(W m nE_)-rX(W m nG)} 

> r[X(W m )-X(W m nE_)-rX(W m )} 
= (r-T 2 )X(W m )-rX(W m nE.). 

This implies that 

A (W m n E.) > T —^X{W m ) = M- lT —^X{Q^) . 
1 + r 1 + r 

Let us consider the case where r is in the range < r < v/2 - 1. bmce 

A(Q ( ^ 1} n E + ) > X{Q^ 1] n £_) > A(W m n EJ) 

and M" 1 ^ = s for this range of values of r, we see that in this case W = ^ satisfies 
(j7.12p . In the remaining case, where r > \/2 — 1, we simply observe that the given condition 
(17. lip also holds when r is replaced by the smaller number y/2 — 1 and so we can apply the 
same argument for this smaller value of r to obtain the required conclusion. This completes 
the proof of the theorem. □ 
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8. Applying our "geometrical" result. The non increasing rearrangement 

of a bmo function. 

Suppose that a function / of d- variables is in BMO (D, Q{D)) for some cube D in W 1 . It 
is known [3], [7] that this implies that the function of one variable /*, i.e., the non increasing 
rearrangement of /, is in BMO (J, Q{I)) and that 

(8-1) ||/* II BMO(I,Q(I)) — C\\f\\ B MO(D,Q(D)) 

for some constant C depending only on d. (Of course here we are in fact considering only 
the restriction of /* to the interval (0, X(D)).) Apparently the optimal value of C for which 
( 18.1 p holds for all such / is not yet known. Nor is it known yet whether or not C can 
be chosen to in fact be independent of d. It is known [T3] that C = 1 when d — 1. An 
analogous result holds for dyadic intervals |13| . 

In this section we wish to obtain inequalities analogous to ( 18. ip . in terms of the functional 
of John and Stromberg, namely inequalities of the form 

(8-2) 11/ WbMO(I,Q{I)) — C \\f\\BMO(D,S) 

where, as above, / is the interval / = (0, A(-D)) and s and a are suitably chosen numbers 
in (0, 1/2]. We will be particularly interested in the cases where D is a cube or a special 
rectangle and £ is Q(D) or W(D). But our results for these will be consequences of an 
analogous result for more general choices of D and £ . 

The constant C in our versions of (18. 2p will be C = 1 and this is apparently the best 
possible constant for any and every choice of the parameters a and s in (0, 1/2]. We will 
be able to take our parameter a to be any number satisfying 

1 2^2 - 2 

- > a > -Z- w 0.453082 . 

2~ 2^2-1 

(As mentioned earlier, values of a > 1/2 are not particularly interesting, since ||/*||smo(j Q(-O) 
can equal also when / is not a constant function.) Our parameter s will depend on our 
choice of £. In fact it will be exactly the parameter given by the formula s = min 1 3 ~%/^ , s\ 

in Theorem 17.81 Thus, exactly as in Corollary 17. 9[ we will have s = 2~ d (3 — 2y/2) if £ is 
either Q(D) or T>(D) and D is, respectively a cube, or a dyadic cube, and, furthermore, we 
will have s = (3 - 2y/2) /2 if £ = W{D) and D is a special rectangle. 

We should now stress that we will not obtain our versions of the inequality (18.21) . which 
we have spent the last few paragraphs describing, in as much generality as the reader 
may have been led to expect. We will only obtain them for the very special class of those 
measurable functions / on D which take only non negative integer values, and which satisfy 
H/Hbmo(D£) — 1/2- But, in view of the results of Section^ notably Theorem 16.51 this will 
be sufficient for our subsequent purposes. 

It is perhaps surprising, and perhaps even amusing, that, at a certain stage, it will 
turn out to be sufficient to consider an even more restricted subclass of the very special 
class of functions just referred to, namely those functions which only assume at most three 
consecutive integer values, which may just as well be 0, 1 and 2. 
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All the results which we have just described are immediate consequences of the following 
theorem (Theorem 18. 2p . In turn Theorem 18.21 will follow from a more general theorem 
(Theorem 18 .3p which will be formulated after this one. 

Remark 8.1. In the formulation of both of the following two theorems we consider a collec- 
tion £ of admissible sets and a particular set Q G £. Instead of £, we use the subcollection 
£{Q) of all sets in £ which are contained in Q. We are essentially forced to do this because 
our function / is defined only on Q. Obviously, if / happens to be defined on all sets in £ 
the conclusions of both theorems will remain true if we replace £(Q) by the larger collection 
£. 

Theorem 8.2. Let £ be an M -multidecompo sable collection for some M > 2. Let 5 be a 
bi-density constant for £. Let Q be a set in £ and let £(Q) be the collection of all sets in 
£ which are contained in Q. Suppose that the function f : Q — > N U {0} is measurable and 

satisfies \\f\\BMO(QS(Q)) — 1/2 f or s — mm 3 ~l/^ j ■ Let a be a number in the range 

, , 2^-2 1 

8.3 — -= < a < - . 

K J 2V2-1 ~2 

Then the function f* : (0,X(Q)) — > [0,oo), i.e., the non increasing rearrangement of f 
restricted to the interval I := (0, X(Q)) satisfies 

II f*l|(J.°0 < 11 f ||(J,s) 

WJ \\BMO(I,Q(I)) — WJ \\BMO(Q,£(Q)) ' 

We want to obtain this theorem as a consequence of the following somewhat more abstract 
and general theorem, which is formulated in terms of John-Stromberg pairs (r, s) for the 
collection £. By introducing this extra level of abstraction we also make it possible to 
formulate the consequences of an affirmative answer to Question A in a (hopefully) clearer 
and more organized way. 

Theorem 8.3. Let £ be a collection of admissible subsets ofR d . Let Q be a set in £ and 
let £(Q) be the collection of all sets in £ which are contained in Q. Let r G (0, 1/2) and 
s G (0, 1/2) be such that (r, s) is a John-Stromberg pair for £. Suppose that the function 
f : Q — > N U {0} is measurable and satisfies ||/||bmo(Q£(Q)) — • Let a be a number in 
the range 

, . 2r 1 

8.4 < a < - . 

K J 1 + 2r - 2 

Then the function f* : (0, X(Q)) — > [0,oo), i.e., the non increasing rearrangement of f 
restricted to the interval I := (0, X(Q)), satisfies 

(8-5) H/*llsAfO(/,S(/)) - W^WbMO{Q,£{Q)) ■ 

Remark 8.4. We find it interesting that the "geometric" condition of the existence of a 
John-Stromberg pair turns out to be in some sense "almost equivalent" to the "analytic" 
condition expressed by the inequality (18. 5p . This is revealed by combining Theorem 18.31 
with an auxiliary result (Theorem 18.51) which we will defer to the end of this section, since 
it will not be needed for obtaining our main result in Section [9j Theorem 18. 51 will be a sort 
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of converse to Theorem 18.31 (The fact that we use £(Q) rather than £ in the previous two 
theorems makes their results more closely comparable with the result of Theorem 18.51 ) 

Proofs of Theorems Iff.jjl an dSJA 

Let us first show that Theorem 18.21 is a consequence of 18.31 Since the conclusions of both 
theorems are the same, this simply amounts to showing that the conditions imposed on £ 
and s and a in Theorem 18.21 suffice to guarantee that £ and s and a satisfy the hypotheses 
of Theorem 18.31 for some suitable choice of r. In fact we will choose r = \/2 — 1 so that 
TTlr = 2vl-i " ^° w ^ en ' m Theorem [23 we require a to satisfy (18. 3p . this ensures that 
a will be in the range specified in (18.41) . It remains only to check that (r, s) is a John- 
Stromberg pair for £ when r = \pl — 1 and s = min |<5, 3 jj^ j- But this is exactly what 

is stated by Theorem 17.81 for these choices of r and s . 

Thus we can now turn to the proof of Theorem 18.31 Since we only have to deal with sets 
of £ which are contained in Q we may suppose from here onwards that £ = £(Q). 

The fact that all values taken by / are in NU {0} readily implies that the same is true 
for all values of /*. To explain this more precisely, since X(Q) < oo, we can invoke (13. 4p to 
obtain that 

(8.6) A ({x G Q : f{x) =m}) = \{te (0, \{Q)) : f*(t) = m}\ for all m G N U {0} . 
Then the fact that 

oo 

J2X({xeQ:f(x)=m}) = X(Q) 

implies that the subset of (0, X(Q)) where /* takes non integer values has measure zero. 
Let us first dispose of three easier special cases where we can readily see that (18.51) holds. 
The first of these cases is when / takes only one value (on sets of positive measure). 

Then of course H/H^/^q g> — I|/*IIbmo(7 q(i)) = no matter how we choose a and s. So 
we obtain (I8.5p . 

The second case is when / takes only two values (on sets of positive measure). In this 
case / = a\A + ^Xq\a f° r some A C Q such that < A (A) < X(Q) and where b < a. 
Since (cf. Remark 17. 15p s is also a bi-density constant for £, there exists some set Wo G £ 
contained in Q such that 

min {X(W n A), X(W \A)}> sX(W ) . 

This means that the restriction of (fxw )* to the interval (0, X(Wq)) is given by the formula 
(fXwo)* = a X(o,r) + bx[r,\(w )) f° r some number r G [sA(W ), (1 — s)A(H / )] • For all choices 
of the number u which satisfy u G (0, sX(Wq)) we have u G (0, r) and 

u+(l-s)X(W ) G [r,X(W )). 

Consequently, (fxw )* ( u ) = a an d {fXw T ( u + (1 ~ S )X(W )) = b. This implies, by 
Proposition 14. 5[ that J(/, Wo, s) = for this particular set Wo- Proposition 14.51 also tells 

us that J(/, W, s) < ^ for all other sets W in £ , so we conclude that 11 ^ l|(J,s) - ^ 



2 iui an wunci " in j ™ uiiai; nJ \\BMO(Q,£) ~ 2 ■ 

Since 1/2 is a bi-density constant for Q(I) and a G (0,1/2] it follows that a is also a 
bi-density constant for Q(I). So we can show, by applying reasoning to /* and a, which 
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is exactly analogous to the reasoning just applied to / and s, that \\f*\\^MO(i Q(i)) = 
Thus we see that (18. 5p holds in this case also. 

The third and last of these easier cases is when ||/||^/o(Q s) < V^- We will deal with 
this case by showing that here / has to be a constant. The fact that / and /* are integer 
valued, together with the formula (14.111) . tells us that the two functionals \\f*\\~Bhio{i Q{i)) 

and II/II^mckq e) are eacn infhna of appropriate subsets of the set {(n — l)/2 : n G N} of 
non negative half-integers. Thus these functionals themselves can only take non negative 
half integer values. More explicitly, if ||/||S? 0(Q)g ) < 1/2, then ||/|| bmo(q,£) = °- If / is 
constant a.e., then so is /* and, as in the first case, we see, trivially that (18.51) holds. We 
shall now show that this is the only possibility. Suppose, on the contrary, that / is not a 
constant a.e. Then let b be the smallest non negative integer (there must exist at least two 
such integers) for which the set B = {x G Q : f(x) = b} has positive measure. Then the 
set 

A = {xeQ: f(x) > b} = {x G Q : f(x) > b + 1} 

must also have positive measure and we must have X(B) + A (A) = X(Q). 

Let us now define the functions tp : R — y M. and : K. — > K by ip(t) = max {t, b} and then 
(p{t) = min{6 + l,tp(t)}. Obviously ip and therefore also tp are both 1-Lipschitz functions 
and therefore, by Lemma [4.31 we have that 

(8-r) II^°/II&W)^II/IIbm ) o ( q,,) = o. 

We also have <p o f = b\B + (b + 1)xa almost everywhere. Therefore, the same calculation 
that we did in case (ii) for a function taking only two different values a.e., gives here that 

ll*of||< J "> _ \(b+l)-b\ _l 
u J \\BMO{Q,S) ~ 2 ~ 2' 

This contradicts (18. 7\i and shows that / indeed must be a constant, completing our treat- 
ment of this case. 

Having disposed of these three cases, we can, from this point onwards, assume that / 
takes three or more different values on sets of positive measure, and we can also assume 
that ||/|Ismo(q e) = 1/2- Let us suppose that (18.51) does not hold, i.e., that 

( 8 - 8 ) Wf*\\BMO(I,Q(I)) > l / 2 - 

We will complete the proof of Theorem 18. 3l by showing that ( 18. 8 \ leads to a contradiction. It 
follows from ( 18. 8 j) that there exists some subinterval I 1 = (a, b) of I such that J(/*, I', a) > 
1/2. This means that the number a = 1/2 is not in the "competition" for the infimum in 
the formula analogous to (14. 9 j) for J(/*, I', cr) . So, for every c G M , we have 

|0 G (a, b) : \f*(t) -c\< 1/2}| < (1 - a){b - a) . 

If k is any integer and if we choose c = k + 1/2, then, since / and /* only take integer 
values, we have that 

{t G (a,b) : \f*(t) -c\ < 1/2} = {t G (a, 6) : c - 1/2 < /*(*) < c + 1/2} 

= {te(a,b):f*(t)e{k,k + l}} . 
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So, in fact, 

(8.9) \{t G (a, b) : /*(*) G {k, k + 1}}| < (1 - a)(b - a) for each keZ. 

(It may of course happen that this set is empty for some or even most values of k.) 
Since 

(a,b)= \J{te(a,b):f(t)<k} 

fceN 

and since the set {t G (a, b) : f*(t) < 0} is empty, there exists a unique integer k G N such 
that 

\{te(a,b):f*(t)<k-l}\ < a -(b-a)<\{te{a,b)--P(t)<k}\ 

(8.10) 1 

= \{te(a,b):f*(t)<k-l}\ 

The interval (a,b) is the union of the three disjoint sets {t G (a,b) : f*(t) < k — 1}, 
{t G (a, 6) : f*(t) e{k-l, k}} and {t G (a, 6) : f*(t) > k + 1}. By (JOJ and (18~10|) . the 
measure of the union of the first two of these does not exceed (l — f ) (b — a). Therefore 
we conclude that 

(8.11) \{te(a,b)-nt)>k + l}\>^(b-a). 

This implies that there exists some integer m > k + 1 for which the set 

{te(a,b):f*(t)=m} 

has positive measure. Let us also show that the set {t G (a, b) : f*(t) — k — 1} has positive 
measure. Its measure satisfies 



10 e M) 
10 e M) 
10 e (a, 6) 



rw = *-i>i 

/*(*) < A: - 1}| - ]{t e (a, 6) : < A: - 2}] 
f*(t)<k}\-\{te(a,b):r(t)<k-l}\ 



which, by (I8.10p . is indeed positive. 

We now know that on the interval (a, b) the function /* assumes at least one value strictly 
larger than k and at least one value strictly less than k. Since / is non increasing, this 
means that 

(8.12) e (a, b) : /*(*) = fc} = {t G (0, A(Q)) : /* W = ^} • 

Now let us define the three sets 

£_ = {x G g : f(x) <k-l},G = {xeQ: f(x) = k} and E + = {x G Q : /(x) > fc + 1} . 

By properties of the non increasing rearrangement, or, more specifically, in view of (18. 6p . 
we obtain that the A measures of these three sets are equal, respectively, to 

10 e (0, A(Q)) : f* (t) < k - 1}| and \{t G (0, A(Q)) : /*(*) = k}\ 

and |0 G (0, A(Q)) : /*(*) > fc + 1}|. Consequently, using fl8~T2"|) and then (JB~2D, we see 
that X(G) < (1 - <r)(6 - a). Then (IHTOj) and flHTT]) give us that A(£L) > f (6 - a) and 
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X(E + ) > |(6 — a). Therefore we have 

(8.13) min {A(£_), X(E + )} > _^-A(C7) . 

2(1 - a) 

It is a routine matter to check that the condition (18.41) is equivalent to 

< 8J4) t< w^D A- 

Let us pause for a moment to point out that we have finally reached the only step of 
the proof which needs a non trivial "geometric" input, namely the fact that (r, s) is a 
John-Stromberg pair for £. 

To know that we have this fact in the particular case that appears in the formulation of 
Theorem l8.2l we need to apply our "geometric" Theorem 17.81 To know that we have this fact 
for other particular collections £, or with better values of the constant s, we would need an 
affirmative answer to Question A, or to some other question. In the present theorem we have 
simply "axiomatized the 'geometric' problem away" by invoking a convenient definition. 

Now let us resume our formal proof: The estimates (I8.14p and (I8.13P give us the inequality 
( I7.13P which appears in Definition 17.101 Since we have required that (r, s) is a John- 
Stromberg pair for £ , this guarantees that there exists a set W G £ for which W C Q 
and 

(8.15) min {X(E + n W), A(£L HW)} > sX(W) . 

Let us now, analogously to what we did in the easy case (iii) above, define the functions ip : 
R -> E and ip : R -> R by if>(t) = max {t, k - 1} and then <p(t) = min {k + 1, i>(t)}. Here 
again it is obvious that ip and therefore also if are both 1-Lipschitz functions. Therefore, 
again by Lemma [4.31 and by our hypotheses on /, we have that 

(8-16) \\V o f\\^ om) < \\f&o iQ , £) < V2 • 

Note also that the function tp o f takes precisely three values, namely k — 1, k and k + 1. 
Our final step will be to show that the set W which satisfies (I8.15P must also satisfy 

(8.17) J(^o/,W,s)>l. 

This will contradict (I8.16P and so show that the assumption ( 18. 8 j) must be false, and thus 
will suffice to complete the proof of Theorem i !8.3l 

To simplify the notation, let us set g — tp o f. As already remarked above, this non 
negative function takes only the three values k — 1, k, and k + 1. More precisely, when 
we restrict g to the cube W, it takes these three values, respectively, on the sets E_ D W, 
G H W and E + D W, whose union is W. 

Thus the restriction of (gxw)* to the interval (0, X(W)) is given by 

{k + 1 , < t < a 
k , a < t < b 
k-1 , b<t<X(W) 

where a = X(E + n W) , b - a = X(G n W) and X{W) -b = X(E. n W). 

Let I = [u, u + (1 — s)A(W / )] be an arbitrary closed interval of length (1 — s)X{W) which 
is contained in (0,X(W)). Since the left interval (0, a) has length not less than sX(W) we 
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conclude that the left endpoint of / must line in (0,a). Similarly, since the right interval 
[b, X(W)) also has length not less than s\(W), the right endpoint of must lie in [b, X(W)). 
We deduce that 

(9XwT (u) - (gxwT (u + (1 - s)A(Q)) = (k + 1) - (k - 1) = 2 . 

Therefore, by Proposition 14. 5[ we have that 3(g,W,s) = 1 which establishes (j8.17p . As 
already explained above, this suffices to complete the proof of Theorem 18.21 □ 

We conclude this section by stating and proving the auxiliary result alluded to above in 
Remark 18.4} which is a sort of converse to Theorem 18.31 but which will not be needed for 
other purposes here. Note that here the connection (18.41) between r and a of Theorem 18.31 
has to be replaced by the different connection (I8.18p . Also the collection S appearing in 
( 18. 5 P has to be replaced by the closely related collection S(Q). 

Theorem 8.5. Let S be a collection of admissible subsets ofM. d . Let s and a be two given 
numbers in (0, 1/2) and let r be any number in (0, 1) satisfying 

(8.18) < a < T . 

V ' 1 + 2r 

Suppose that, for each Q G S, the inequality 

,(J,er) . || ,|.(J,«) 



(8.19) urn 

BMO(I,Q(I)) — WJ \\BMO(Q,£(Q)) 

holds for the interval I = (0,X(Q)) and for every measurable function / : Q — > R which 
assumes only the three values 0, 1 and 2. Here S(Q) denotes the collection of all those sets 
in S which are contained in Q. 

Then (r, s) is a John-Strdmberg pair for £ . 

Proof. Let Q be an arbitrary set in 8. Suppose that Q is the disjoint union of measurable 
sets E + , £L and G. Suppose that 

(8.20) min {X(E+), X(E_)} > tX(G) . 

We have to show that there exists some W G S such that W C Q and 

(8.21) min {X(E + n W), X(E_ n W)} > sX(W) . 
We may also suppose, without loss of generality, that 

(8.22) A(£_)< X(E + ), 

since, if not, we can simply interchange the roles of E + and E_. Let / : Q — > [0, oo) be the 
measurable function / = 2\e^ + Xg- Then 

/* = 2 X(0,A(£_)) + X[A(E_),A(E-)+A(C?)) • 

In view of flj£22p , the interval J := (0,2A(E_) + X(G)) is contained in (0, X(Q)). In view 

of (JS2QJ, 

|/ | = 2A( J B_) + X(G) > (2r + l)A(G) 

and therefore X(G) < l-^ol • Since the inequality (18.181) implies that ^rn - < 1 — 2cr, we 
deduce that 

(8.23) X{G) < (1 - 2a) \I Q \ . 
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On the interval Iq, the function /* takes the value 1 on a subinterval Iq of length X(G) 
located centrally in J and it takes the values 2 and 0, respectively, on two intervals I_ and 
I + , both of length A (.EL), located respectively on the left and right sides of Iq. In view of 
(I8.23p . each of these two intervals has length greater than a \Iq\. Now let 

h = [u,u + (1 -a) |/ |] 

be an arbitrary closed interval of length (1 — er) |/ | contained in Iq. This means that 
< u < a | Jo I- The fact that u < ct\Iq\ ensures that the left endpoint of Iq must lie in the 
interior of J_. The fact that u > ensures that the right endpoint of I\ must lie in the 
interior of I + . It follows that 

r(n)-r(n + (l-a)|/ |) = 2 

for each such interval I\. Consequently, using (14. lip , we deduce that ||/*||bmo(/ QU)) = ^ 
The hypotheses of the current theorem include the assumption that our function / satisfies 
(I8.19p . Note also that the formula ( 14. lip and the fact that the ranges of / and /* are both 

contained in [0, 2] guarantee that neither H/HsAfoCQ £ (Q)) nor H^"*Hbmo(j Q(i)) can ^ e g rea t er 
than 1. We deduce that 

( 8 .24) imi( J > s ) 



\BMO(Q,£(Q)) 

We now use ( I8.24p to deduce the existence of a set W G E(Q) such that 
(8.25) 3(f,W,s) = l. 

(When calculating ||/||g^o(Q S(Q)) we ^ a ^ e ^ e supremum of a set of numbers which is a 
subset of {0, |, l}.) 

To complete the proof we have to show that W is a set with the required property ( 18.21j) . 
Let us suppose that this is not the case and show that this leads to a contradiction. Our 
supposition means that at least one of the two sets (E + U G) D W and (EL U G) D W must 
have A measure strictly greater than (1 — s)X(W). 

Let h : (0,X(W)) — > [0, oo) be the restriction to (0,X(W)) of the non increasing re- 
arrangement of fxw- Then the statement of the last sentence of the preceding paragraph 
means that there exists an subinterval 1$ of the interval (0, X(W)) which has length strictly 
greater than (1 — s)X(W) on which h takes only two values, and that these two values are 
either and 1 or 1 and 2. This enables us to find a closed interval [u,u+ (1 — s)A(VF)] 
contained in the interior of I for which 

h(u)-h(u+(l-s)X(W)) < 1 

and therefore J (/, W,s) < 1/2. This contradicts (I8.25P and therefore completes the proof 
of the theorem. □ 

9. Putting all the pieces together. 

Now at last we can combine our results from previous sections to obtain our versions of 
the John-Nirenberg and John-Stromberg inequalities. The following theorem does this. It 
also explicitly and immediately shows (keeping in mind Remark I7.14p the consequence of 
an affirmative answer to Question A, thus proving what we claimed at the very beginning 
of this paper. 
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The hypotheses imposed on £, Q, r and s in this theorem are exactly those which 
were imposed in Theorem 18.31 . But here the parameter a does not need to be explicitly 
mentioned. 

Theorem 9.1. Let £ be a collection of admissible subset ofM. d . Let Q be a set in £ which 
contains all other sets of £. Let r G (0,1/2) and s G (0,1/2) be such that (r,s) is a 
John-Strdmberg pair for £ . Then, for every constant r in the range 1 < r < l/2r, the 
inequalities 

alogr 



BMO(Q,£) . 



as log r 



(9.1) A ({x G Q : \f(x) -m\> a}) < max {r, 2y/r~) ■ X(Q) ■ exp 
and 

(9.2) A {{x G Q : \ f(x) -m\> a}) < max {r, 2^} • A(Q) ■ exp 

\BMO(Q,£) 

hold for every a > 0, every measurable f : Q — y 1R, and every median m of f on Q. 

Remark 9.2. The inequality ( 11.11) which appears at the very beginning of this paper is of 
course simply ( 19. 2ft with r = l/2r. This is in some sense the most "pertinent" value of r to 
choose since it gives the best control of the left hand side of (19. 2p when we consider large 
values of a 

Proof of Theorem \9.1[ Let us prepare some ingredients which will later enable us to apply 
Theorem 16.51 In our application the sets D and E which appear in the statement of 
Theorem 16.51 will both be taken to equal the set Q specified in the formulation here of 
Theorem 19.11 Let / : Q — > N U {0} be an arbitrary function in the class $ which is defined 
in Theorem 16.51 Let a be a number satisfying the condition (18. 4p which is imposed in 
Theorem 18.31 Then, since / and a and s satisfy the hypotheses of Theorem 18.31 we can 
deduce from Theorem 18.31 that 

(9.3) urii 

BMO(I,Q(I)) — \\f\\BMO(Q,e) ' 

where I is the interval (0,A(Q))- The fact that /* is right continuous and non increasing 
on / ensures that, for every a > 0, it satisfies the inequality corresponding to (15. 7\i which 
here takes the form 

(9.4) |0 G I : f*(t) - f* (A(Q)/2) > a}\ < ■ |/| ■ exp ' 



2a V 2 II /"*|| (j ' ct) 

As an element of $, the function / must satisfy 
(9.5) A({xGQ:/(x)>0})<^A(Q). 

This implies that f*(t) = for all t > A(Q)/2 and therefore also that /* (A(Q)/2) = 0. 
(Use properties (i) and (ii) of Section [3j) 
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Now, for every a > we can apply (13. 2h to the left hand side of (19.41) and use (19. 3p to 
bound the right hand side of (19 ,4p from above. This gives us that 

(9.6) A ({x G Q : f(x) > a}) < ■ X(Q) ■ exp ' alog (-~ 



2(7 \ 9llfll (J '' 

BMO(Q,£) , 

We deduce, using (19. 6p when a > 0, and using (19. 5p . together with the fact that > |, 
when a = 0, that 

A ({x G Q : f(x) >a})<^- X(Q) • exp ( 

for all a > 0. This last inequality corresponds to the inequality (I6.14|) of Theorem 16.5} and 
the fact that it holds for every / G $ is exactly what we need to justify applying Theorem 
16.51 Since here the constants C and c of (I6.14p are respectively 



2(7 2 
the formula (I6.15P furnished by Theorem 16.51 takes the form 

(9.7) 



A ({x G Q : \ f(x)-m\ > a}) < 2max<j ^— -, W- 1 ■ A(Q) - exp ' ^ a 



2(7 ' V o v 7 \ 8 II fll (J,s) 

) \ ° WJ UBMO(Q,£), 

Let us put r = - — 1 = ^ =£ . Since a satisfies (18.4J1 and since, as already observed during 
the proof of Theorem 18. 3| (18 .4p is equivalent to (I8.14p , it follows that r < l/2r < 1/2 and 
so 1 < r < 1/2t. Thus (19. 7p gives us (19. ip for all r G [1, l/2r), and therefore also, by 
continuity, at the endpoint r = l/2r. Finally (19. 2p follows immediately, in view of (14. 8p . 
□ 

Remark 9.3. Let us consider Theorem 19.11 in the particular case where Q is a special rec- 
tangle and S = W(Q). Then we know (cf. Corollary 17.91 or Example 17. 12p that we can take 
r = y/2 — 1 and s = (3 — 2y / 2)/2 ~ 0.0857864. So the parameter r can range between 1 
and 1/(2^2-2) w 1.20711. Consequently, the right hand side of (Q can be, for example, 

■ m • exp f_ M3-2V2)Ml/(2V2-2)) 

V 16 II/IUjv/o 

where H/H^zo ^ s semm orm defined in (I2.14p . This expression is approximately equal 

to 2.197 ■ X(Q) ■ exp ( — .. ° t ^ 2a ) . Wik obtains a smaller expression, approximately equal to 

V WjWbmoJ 

2X(Q) exp 



0.043o 



\BMO 



10. Towards an answer to Question A 

The reader who has already traversed all the previous sections of this paper to get to 
here is presumably convinced by now that it is worth trying to answer Question A. So let 
us spend this section trying to offer some help towards that goal. We will formulate three 
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new questions. The first and second of them seem to be somewhat easier to answer than 
Question A, and we shall see that they are each essentially equivalent to Question A. The 
third question is of a somewhat different nature, and not equivalent. But we shall indicate 
why we consider it also to be well worth considering. 

For each x G M. d and r > we recall the standard notation Q(x, r) for the set 



This is of course the closed cube in M. d centred at the point x and having side length 2r . 
We will need to use two simple properties of such cubes, which we state in the following 
lemma: 

Lemma 10.1. Let {x n } ngN be a sequence of points in M. d and let {r n } neN be a sequence of 
positive numbers which converge, respectively to the point x* G M. d and the positive number 
r*. 

(i) If all of the cubes Q(x n ,r n ) are contained in some closed set Q then Q(x*,r*) is also 
contained in Q. 

(ii) Let A be a fixed measurable subset ofM. d . Then 



The proof of this lemma is an easy exercise. In this preliminary version of our paper we 
include it as Appendix II 1 .71 



10.1. Reducing the description of John-Stromberg pairs to the case of sets which 
are finite unions of cubes. 

The following theorem shows that, in order to obtain an affirmative answer to Question A, 
or merely to determine any pairs (r, s) which are John-Stromberg pairs for the collection 
of all cubes in M. d , it suffices to consider only those sets E + and E_ which are of a compar- 
atively simple form. This also suggests that one might choose to reformulate Question A 
and the question of finding John-Stromberg pairs for cubes in W 1 in terms which are more 
in the realm of combinatorics, counting lattice points etc. 

Theorem 10.2. Suppose that the numbers r G (0,1/2) and s > have the following 
property: 

Whenever F + and F_ are disjoint subsets of the closed unit cube Q = [0, l] d in M. d such 
that 



and also each of the sets F + and F_ is the union of finitely many dyadic cubes, then there 
exists some cube W contained in Q for which 






lim X(A n Q(x n , r„)) = X(A n Q(x*,r*)) . 



min {X(F + ), A(F_)} > r\(Q \ F+ \ F_) , 



min {\(W n F+), \(W fl F_ ) } > s\(W) . 



Then (r, s) is a John-Stromberg pair for the collection of cubes in M. d . 
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Proof. We will use the notation "b" to stand for a subscript that is either "+" or "— ". More 
precisely, whenever we write a formula where b appears as a subscript in one or more places, 
this expresses the fact that the formula holds in both of the cases: 

(i) whenever that subscript is replaced throughout by the subscript +, and also 

(ii) whenever it is replaced throughout by the subscript — . 

We start by writing down two simple formulae which will be useful later: 
Suppose that V + and V- are two disjoint measurable subsets of Q and that U+ and U- 
are measurable subsets, respectively, of V + and of V— Then, obviously, 

(10.3) A(H) = A(t/b) + A (H \ U\>) 
and so, consequently, 

\(Q\U+\UJ) = X(Q) - X(U+) - A(IL) 

= A(Q) - X(V + ) - \(V-) + X(V + \ U + ) + \{V_ \ U.) 

(10.4) = \{Q\V+\V-) + \{V + \U+) + \{V-\U_). 

After this preparation, let us suppose that E + and E_ are disjoint arbitrary measurable 
subsets of the cube Q which satisfy 

min {\(E+), \(E-)} > rX{Q \E + \E_). 

In order to prove the theorem, we have to find a cube W contained in Q for which 

(10.5) min {X(W D E+), X(W flE_)}> sX(W) . 

The obvious fact that we need only consider the case where Q is the unit cube Q = [0, l] d 
was already pointed out in Remark !7.13I We can of course assume without loss of generality 
that E + and E_ are both contained in Q°, the interior of Q. 

Since Lebesgue measure is inner regular, there exist compact sets H + and H_ contained 
respectively in E + and E- such that X(E + \H + ) and A(£L \H_) are both sufficiently small 
to guarantee (of course via formulae like fl 1 . 3 [) and (110. 4p ) that 

min {X(H+), A(#_)} > rX{Q \ H + \ Hj) . 

We let Eq be a positive number which is chosen sufficiently small so that it satisfies 

(10.6) 2e + 4re < min {X(H + ), X(H_)} - tX(Q \H + \ HJ) . 

Let S = dist (H + , ff_).This is of course a positive number, since H + and H_ are disjoint 
and compact. We shall use 5 to obtain two disjoint open sets Q + and Q_ contained in Q°, 
such that H\) C and 

(10.7) dist(fi + ,fi_) > 

Initially we can choose to be the set Q\> = Q° fl [J xeH (x + B), where B is the open 
ball of radius 5/4 centred at the origin, and this indeed will guarantee that (110. 7ft holds. 
But then, furthermore, since Lebesgue measure is outer regular, we can, by replacing Q\, if 
necessary by its intersection with some other open set containing H\,, assume also that 

(10.8) A(ft b \# b )<£ 
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(10.9) X(H b )<X(n b )<X(H b )+e . 

Now we use the fact (see e.g. [26J Theorem (1.11) p. 8) that every open set Q in M. d is the 
union of some sequence {D n } n&N of non overlapping closed dyadic cubes. In particular we 
shall write Q b = {J neN D bn ^ or both choices of b. Obviously all of the dyadic cubes D bn 
have to be contained in Q. 

Let N be a positive integer which is sufficiently large to ensure that 

oo 

(10.10) Yl *(A,n)<eo 

n=N+l 

for both choices of b. Of course there exists such an N since 

^A(A,n) = A(ft b )<A(Q )<oo. 



n=l 



Then let F b be the compact set F b = {J^=i A,n- 

We want to be able to apply the hypotheses of the theorem to the two disjoint sets F + 
and F_. So we need estimates from below for the measures of each of these sets. But first 
we note that F b C fl b and, by (110. 10p . 



X ( |J A,n) 

\n>AT / 



(10.11) X({l b \F b ) 
It follows (cf. (P3E3D ) that 

X(F b )<X(n,)<X(F b )+e . 
This last estimate, together with (110. 9p . gives us that 

(10.12) \X(F b ) - X(H b )\ <2e . 

Now we substitute V b = Q b and U b = H b in (110. 4p and then apply (110. 8p to obtain that 

\X(Q \ H + \ H_) - X(Q \n + \n_)\< 2e . 

Then, analogously, we substitute V b = Q b and U b = F b in (j!0.4p and then apply (110. lip to 
obtain that 

\X(Q \ F + \ F_) - X(Q \ Q + \ < 2e . 
The preceding two inequalities imply that 

(10.13) \X(Q \ F + \ F_) - A(Q \H + \HJ)\< Ae . 

We can now deduce our required estimates from below for X(F b ). In the following cal- 
culation we shall use (110. 12[) in the first line, and then (110.61) in the third line, and then 
(110. 13p in the fourth line. 
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A(F b ) > \(H b )-2e 

> min{A(F + ),A(if_)}-2£ 

> tX(Q\H + \H^ + (2e + 4te )-2e 

> t\(Q\F + \F_) + (2e + Are ) - 2e - Ate 
= rX(Q\F + \F.). 

Since the sets F + and F_ are disjoint and are each finite unions of dyadic intervals, the 
estimate which we have just obtained for X(F + ) and A(F_) is exactly the one which we 
require to apply the hypothesis of our theorem. That hypothesis implies that there exists 
a cube, which we will denote by W(eq) contained in Q, for which 

(10.14) min {X(W(e ) n F+), X(W(s ) n F_)} > sA(W(e )) > . 



We deduce from (jl0.7j) that dist(F + ,F_) > |. This in turn implies that the cube W(e$) 
must satisfy 

(10.15) diamW(e ) > - 

since it contains points of F + and also points of F_. 
It follows from f lTMl) that 

\((W(e )n^)\(W(e )nH b )) <s 

and, from fllO.llj) . that 

X((W(e )nQ b )\(W(e )nF b ))<e . 
Therefore, for both choices of b, 

\X(W(eo)nF £ )-X(W(e )nH b )\ 

< \X(W(e ) n F £ ) - X(W(e ) n fib) I + |A(W(e„) n fi e ) - A(W(e ) D fT b )| 

< 2e . 

Combining this inequality with (110. 14p gives us that 



(10.16) mm{X(W(e ) D H + ), X(W(e ) D H^)} > sX(W(e )) - 2e . 

Note that we fixed the sets H + and H_ and therefore also the number 5, before we chose 
the number Eq. This means that, for each number e satisfying < e < Eq, we can use 
exactly the same arguments as were used from ( 110.61) to ( 110. 16f) to obtain a subcube W (e) 
of Q which satisfies (110. 15ft and ( 110. 161) . but with e in place of Eq. In fact we shall do this 
for each E n in a sequence {e„} ngN of numbers in the interval (0, Eq] which converges to 0. 
Thus we obtain a sequence {W(E n )} n£N of subcubes of Q which, for each n, satisfy the 
counterparts of ( 110. 15[) and ( 110. 16[) . namely 

(10.17) diamW(E n ) > - 
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and 

(10.18) mm{\(W(e n )nH + ),\(W(e n )nH_)} > s\(W(e n )) - 2e n . 

For each n G N let r n > and x n G Q be the half side length and centre, respectively, 
of W(e n ). I.e., as in ( 110. ip . we have W(e n ) = Q(x n ,r n ). By passing, if necessary, to a 
subsequence, we can assume that the sequences {r n } ngN and {x n } ngN are both convergent, 
to limits r* > and x* G Q respectively. The condition (110. 17\i ensures that in fact r* > 0. 
Using part (i) of Lemma flO.lj we see that the cube W := Q(x*, r*) is contained in Q. Then, 
applying part (ii) of the same lemma with A chosen to be the set H\,, we obtain that 

lim X(W(e n ) n # b ) = X(W D H±) 

for both choices of the subscript b. Obviously we also have 

lim X(W(e n )) = lim r d = r d = X(W) . 

n—>oo n— >oo 

Thus a passage to the limit as n tends to oo in (jlU.lSp . shows that the cube W satisfies 
min {X(W (1 H + ), X(W n H_)} > sX(W) . Since H b C E\, this last inequality immediately 
implies that W satisfies (110. 5p . This, together with the fact that W C Q and the fact that 
the sets E + and E_ were chosen arbitrarily, suffices to show that (r, s) is a John-Stromberg 
pair for cubes in M. d . □ 

Remark 10.3. We have not bothered to write out the details, but it seems very likely 
that techniques similar to those in the proof of the preceding theorem might show that 
whenever (r, s) is a John-Stromberg pair for the collection of all cubes in M. d then it also 
has the following slightly stronger property: 

Whenever E + and E_ are disjoint admissible subsets of the cube Q in M. d such that 

(10.19) min{A(£+),A(£_)} > rX(Q \E + \ E-) , 
then there exists some cube W contained in Q for which 

min{A(jy f]E + ),X(W HE^)} > sX(W) . 

In other words, it seems very likely that, even if equality holds in (110. 19[) . this is still enough 
to imply the existence of a cube W with the same properties as before. 

10.2. A reduction of Question A to a different special case. 

Here is a variant of Question A. We will call it Question A'. 

Do there exist two absolute constants t' G (0, 1/2) and s > which have the following 
property ? 

For every positive integer d and for every closed cube Q in M. d , whenever F + 
and F_ are two disjoint compact subsets of Q which each have positive mea- 
sure, which are each the union of finitely many closed rectangles, and whose 
d-dimensional Lebesgue measures satisfy 

(10.20) min{A(F + ),A(F_)} > r'X(Q \F + \ F_) , 
then there exists some cube W contained in Q for which 

(10.21) min {X(W n F+), X(W n F_)} > sX(W) . 
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Here the terminology closed rectangle means a subset of Mr which is the cartesian product 
of d bounded closed intervals. Our reason for using rectangles rather than cubes here will 
become apparent later (in Remark 110. 5p . Note that if we did not include the requirement 
that both of F + and _F_ have positive measure, then, in view of the possibility of taking 
F + = Q, the answer to the above question would be negative. 

It follows immediately from Theorem I10.2[ that an affirmative answer to Question A' for 
some constants r' and s would imply an affirmative answer to Question A for every choice 
of the constant r satisfying r G (r', 1/2) and for the same constant s. Thus we may transfer 
our attention from Question A to Question A'. By the same simple reasoning as in Remark 
I7.16| if Question A' has an affirmative answer, then the constant relevant constant s has to 
satisfy s < 1/2. 

We shall show that we can reduce Question A' to yet another question which is, in 
principle, easier to answer. However its formulation is more technical and requires the 
following three definitions. (The second of them is reminiscent of, but slightly different 
from a definition used in the course of the proof of Theorem 17.81 ) 

Definition 10.4. Let Q be a closed cube in M d and let F + and F_ be two disjoint compact 
subsets of Q which (as in Question A') are each unions of finitely many closed rectangles. 
Let t' be a number in (0, 1/2). Let V be a closed subcube of Q. 

(i) We will say that V is an exceptional subcube if A(V H F + ) and A(V D F_) are both 
strictly positive and A(V \F + \ F_) = 0. 

(ii) We will say that V is a good r' -subcube if A(V D F + ) and A(V H are both strictly 
positive and 

min{X(V nF + ),A(VnF„)} > t' \(V \ F + \ F_) > 0. 

(iii) We will say that V is r' -minimal if it is a good r'-subcube but every strictly smaller 
closed subcube of V is neither exceptional nor a good r'-subcube. 

Remark 10.5. Of course, even though our chosen terminologies do not explicitly express 
it, all three of the above notions depend crucially on the choice of the cube Q and of 
its subsets F + and F_. When occasionally necessary, we can replace these terminologies 
by the more explicit exceptional (Q, F + , F_)- subcube and good (r' ,Q, F + , F_)- subcube and 
(r' ,Q, F + , F_) -minimal subcube. In connection with this we will need the following three 
simple observations. We make them in the context where V, V* and Q are closed cubes 
which satisfy V C V* C Q, and F + and F_ are two disjoint compact subsets of Q which are 
unions of finitely many closed rectangles. Then of course V* D F + and K H F_ are disjoint 
compact subsets of K which are finite unions of closed rectangles. (We could not make an 
analogous claim if we considered finite unions of cubes instead of rectangles.) 
Our observations are that 

(i) V is an exceptional (Q, F + , F_)-subcube if and only if 
it is an exceptional (K, V* H F + , K D F„)-subcube, 

(ii) V is a good (r', Q, F+, F_)-subcube if and only if 
it is a good (r', V*, K H F + , V* D F_)-subcube, 

and so, in view of (i) and (ii), 
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(iii) V is (r', Q, F + , F_)-minimal if and only if 
it is good (r', K, V* H F + , K H F_)-minimal. 

When we try to obtain a positive answer for Question A' we of course have to start 
with a given cube Q and subsets F + and F_ of Q such that, in the language of Definition 
110.44 Q itself is a good r'-subcube. Our simplification, which we will state formally in a 
moment (as Theorem I10.6p . is that we only have to consider the special case where Q is 
also r'-minimal. Thus it would be of interest to study the properties of r'-minimal cubes. 
One simple and easily established property of such cubes will be given below in Lemma 
110.71 Later (in Subsection 1 10. 3[) the reader will be invited to consider whether r'-minimal 
cubes have a certain other simple but much less evident property. If they do, this would 
lead to an affirmative answer to Questions A' and A. 

Here then is the reduction of Question A' alluded to above. 

Theorem 10.6. Let r' and s be constants satisfying r' G (0, 1/2) and s G (0, 1/2]. Let Q 
be a closed cube in M. d . 

Suppose that, whenever F + and F_ are two disjoint compact subsets of Q which are each 
unions of finitely many closed rectangles, and are such that Q is a r'-minimal subcube (of 
itself), then there exists a cube W contained in Q such that 

(10.22) min {X(W D F+) t X(W n F_)} > s\(W) . 

Then Question A' has an affirmative answer for the same constants r' and s . 

It will be convenient to present some parts of the proof of this theorem separately in the 
following lemma. 

Lemma 10.7. Let Q, F + , F_ and r' be as in Definition \lU.^ , 

(i) If there exists an exceptional subcube of Q, then there exists a subcube W of Q which 
satisfies 

(10.23) X(W n F+) = X(W n F_) = min {X(W n F + ), X(W D = -X(W) . 

2 

(ii) IfV is a r'-minimal subcube of Q, then it satisfies 

(10.24) min{A(ynF + ),A(V r nF_)} = r'X(V \F + \F_) >0 

and every subcube WofV which is strictly smaller than V satisfies at least one of the two 
conditions 

(10.25) min {X(W (1 F + ), X(W n F_)} = 
and 

(10.26) min {X(W D F + ), X(W n < r'X(W \F + \F_). 

Proof. We first deal with part (i). Suppose that V is an exceptional subcube of Q. We 
want to use Lemma 17.11 The roles of the cube denoted by Q and of the subset E of Q in 
the statement of that lemma will now be played here, respectively, by the cube V and by 
its subset V D F + . To apply the lemma we need to know that X(V D F + ) > which is part 
of the definition of exceptional subcubes, and we also need to know that X(VC\F + ) < A(V). 
This second inequality follows from the given condition X(V (~) F_) > (again part of the 
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definition) and the inclusion V fl i 7 ! C V \ F + = V \ (V R F + ) which together give that 
< X(V fl F_) < X(V) — \(V fl F + ). Thus Lemma ITU can be applied to provide us with a 
subcube W of V for which 

(10.27) A (W \ {V n F + )) = X (w n (V n = 2 A ( M/ ) • 

Obviously 

(10.28) A(^n (vnF+)) = A(w^nF + ) , 

and, since V is essentially the union of V fl _F + and K fl F_, it will also be easy to deduce 
that 

(10.29) \(W\(V C\F + )) = \(W r\F_) . 
More explicitly, since F + fl F_ = and C V, we have 

w n f_ c w \ f+ c w \ (v n f+) c iy \ n f+) 

C W\F + = ((W\F + )\F_)U((W\F + )nF_) 
C (V\F + \F_) u (ifnf_) . 

These inclusions and then the fact that V is an exceptional subcube, imply that 

A(lfn^) < X(W\(V DF+)) 

< A(y \F+ + A(lfnF_) = + A(lfnF_) 

which establishes (110. 29p . The required formula (110. 23p now follows immediately from 
(TTa2Tj) . (rT028|) and (11029]) . 

We now deal with part (ii) of the lemma. Suppose that V is a r'-minimal subcube of 
Q. The fact that every strictly smaller subcube W of V satisfies at least one of the two 
conditions ( 110. 25p and ( 110. 26p is simply a rewriting of definitions. More explicitly, suppose 
that some such subcube W fails to satisfy both ( 110. 25p and ( 110. 26p . Then, in the case 
where t'X(W \ F + \ F_) = , this implies that W is an exceptional subcube. In the case 
where t'X(W \F + \F_) > this implies that W is a good r'-subcube. The given condition 
on V excludes both of these possibilities, so at least one of ( 110. 25p and (110. 26p must be 
satisfied. 

Finally, suppose that V does not satisfy ( 110. 24p . Then, since V is a good r'-subcube, it 
must satisfy 

(10.30) min {X(V fl F + ), X(V n F_)} > r'X(V \F + \F^)>0. 

Now let W be a subcube of V such that X(V \ W) < e. If e is sufficiently small, it follows 
from (irtOUj) (see Remark MM that 

(10.31) min{X(W f]F + ),X(W nF„)} > t'X(W\F + > 

which means that W is also a good r'-subcube, contradicting the r'-minimality of V. This 
shows that (I10.24P holds and so completes the proof of the lemma. □ 
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Remark 10.8. More explicitly, the inequalities which enable us to deduce f II . 3 1 f) from 
(110. 30p for sufficiently small e are, first of all 

X(Wf)F + ) < X(V DF + ) < X(W f]F + ) + X((V\W)nF + ) 
< X(Wf]F + )+e, 

then the counterpart of this where F + is replaced by _F_, and then, finally, 

X(W\F + \F„) < X(V\F + \F_) < X(V\W\F + \F„) + X(W\F + \F_) 
< s + X(W\F + \F^). 

The proof of Theorem \10.b\ Our approach here will have some features in common with 
the proof of Theorem 17.81 Let Q be an arbitrary cube in IR d and let F + and F_ be arbitrary 
disjoint subsets of Q which are both finite unions of bounded closed rectangles and satisfy 
(110. 20p . We have to show that there exists a subcube W of Q which satisfies (110. 21|) . In 
one case this is very easy to do, namely when Q has a subcube which is an exceptional 
cube. We simply invoke part (i) of Lemma 110.71 to obtain a cube W satisfying (110. 23p and 
therefore (110. 2ip . since s < 1/2. This leaves us free to assume, for the rest of this proof, 
that Q does not contain any exceptional subcubes. 

Let us now consider the collection Q of all subcubes of Q which are good r'-subcubes. 
This is non empty since Q itself is such a cube. Since F + and F_ are disjoint and compact, 
we have that 

p : = dist (F+,F_) > 0. 

Since every good cube must intersect with both F + and F_, it follows that the diameter of 
each V G Q satisfies p < diam V < diamQ . Consequently the infimum p* := infy e g diaml/ 
is strictly positive. Furthermore, there exists a sequence of cubes {^4,} ngN in Q such that 
lim^oo diam V n = p*. For each n, let V n = Q{x n ,r n ) (here again using the standard 
notation (110. ip ). By passing to a subsequence of {K 1 } ngN if necessary, we can assume that 
the sequences {x n } neN and {r„} ngN converge, respectively, to a point x* G Q, and to the 
positive number r* = p*/2\fd . We let K = Q(x*, r*). Since each V n is in Q we have 

(10.32) min {X(F + n Q(x n , r n )), A(F_ n Q(x n , r n )} > r'X (Q(x n , r n ) \ F + \ F_) > . 

We can pass to the limit in these inequalities, using three applications of part (ii) of Lemma 
110. H where we choose the set A to be, respectively, F + , F_ and M. d \ F + \ F_. This gives 

(10.33) min {X(F + n Q(x.,rJ), A(F_ n Q{x^ r*)} > r'X (Q(x*, r,) \ F + \ F_) > . 
In fact, we have 

(10.34) A(Q(x*,r*)\F + \F_) > 0. 

We will defer the proof of ( I10.34p for a moment. It follows from part (i) of Lemma fl 0.1 1 that 
K C Q. This, together with (110. 33p and (110. 34p . implies that K is a good r'-subcube. Since 
diamK = p* no strictly smaller subcube of V* can be a good r'-subcube. Furthermore, 
using the assumption that we showed that we could make above, no subcube of V* can 
be exceptional. This means that K is r'-minimal. Thus, in the terminology of Remark 
110. 5[ the cube K is also (r', V*, V* fl F + , V* fl F_)-minimal. Now we are ready to invoke the 
hypothesis of our theorem, but where here the role of the cube Q is now played by V* and 
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the roles of the sets F + and F_ are played by V* D F + and V* D F- . The hypothesis ensures 
that there is a subcube W of V*, and therefore also of Q which satisfies 

min{\(w nv*nF + ),\(wnv*nF_)} > s \(w). 

Since of course W (IV* = W and So > s the cube W satisfies (110. 2ip and is therefore the 
cube required to complete the proof of the theorem. 

It remains only to show that (|10.34|) . Suppose, on the contrary, that 

(10.35) \(Q(x.,r.)\F+\FJ) = 0. 

Since we have excluded the possibility that the cube V* = Q(x*,r*) is an exceptional 
subcube, this means that at least one of A(V* PI F+) and A(V* D F_) must equal 0. We can 
suppose that A(K D F_) = 0, since the other case, where A(K PI F + ) = 0, can be treated 
exactly analogously. This supposition, together with ( 110. 35p . implies that 

= A(K \ F + \ F_) = A(K \ F + ) - A ((K \ F+) n F_) = \(V* \ F+) 

and so A(K) = A(K \ F + ) + A(K D F + ) = A(K fl F + ). This means that every subset of 

V* with positive measure must contain points of F + . Consider the cube Q ^r*,r* + ^^j, 

where, as above, p = dist(F + , Every point in this cube is at distance strictly less than 
p from some points in V* and therefore also at distance strictly less than p from points in 
F+. Therefore, 



10.36) F.nQ 



P 



On the other hand, if n is chosen large enough, we obtain that 

P 
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2vV 

which implies that the cube Q(x n ,,r n ) is contained in Q ^x*,?"* + ^^J- In view of ( 110.31 

the cube Q(x n ,r n ) contains points of F_. This contradicts (110. 36ft . and thus shows that 
( 110. 35p cannot hold. This proves ( 110.341) and completes the proof of the theorem. □ 

10.3. Another question which should be considered. 

As already remarked in the previous subsection, in view of Theorem 110.61 it could be very 
helpful if we can discover some concrete consequences of the apparently very stringent con- 
dition on a cube that it is r'-minimal. More explicitly we invite the reader to consider the 
following question, which we shall call Question B. 

Let Q be a cube in R. d and let F + and F_ be disjoint compact subsets of Q 
which are each finite unions of closed rectangles. Suppose further that Q is 
r'-minimal for some t' E (0, 1/2). Does this imply that \(F+) = A(F_) ? 

This question attracts our attention for reasons expressed by the following two proposi- 
tions. 



Proposition 10.9. In the case where d = 1, the answer to Question B is affirmative for 
every choice of the constant r' G (0, 1/2). 
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Proposition 10.10. If Question B has an affirmative answer for arbitrary dimension d and 
for some constant r' G (0, 1/2) which does not depend on d , then this implies affirmative 
answers for Questions A and A' . 

In the light of these two propositions we have very strong motivation for attempting 
to answer Question B for the case d = 2. An affirmative answer may point the way to 
an affirmative answer for all d and thus also for Questions A' and A. A negative answer 
for d = 2 would apparently lead to a negative answer for all d > 2. While this in itself 
would not imply negative answers to Questions A or A' it could perhaps indicate some path 
towards such negative answers. 

Proof of Proposition [iZTPl We begin with the remark that, since d — 1, we can and will 
make use of the following convenient fact: 

Whenever A and B are cubes with a common endpoint and 
such that A C B, then B\A also coincides a.e. with a cube. 

Unfortunately this fact is not available when d > 1. 

Let t' be an arbitrary constant in (0, 1/2). Let Q be a closed interval Q = [a,b] and 
suppose that F + and F_ are disjoint subsets of Q which are each unions of finitely many 
closed intervals. (In fact our proof will only use the fact that F + and F_ are measurable and 
disjoint). Suppose furthermore that Q is r'- minimal, (i.e., that it is (r', Q, F + , F_)-minimal 
in the notation of Remark 110. 5p . It will be convenient to write G := Q\F + \F^. We may 
apply part (ii) of Lemma 110.71 to obtain that equality must hold in at least one of the two 
inequalities 

K F +) > r 'KG) and A(F_) > r'A(G). 
(Cf. (I10.24|h ) We will suppose, without loss of generality, that A(F_) = r'X(G). (The other 
case can be treated exactly analogously.) 

Suppose, in contradiction to what we seek to prove, that X(F + ) ^ X(F_). For this to 
happen we must have 

(10.37) X(F+) > t'X{G) . 

Let us define the functions u : [a, b] — > R and v : [a, b] — > R by 

u[t) = X ([a, t] n F_) - t'X ([a, t] n G) 

and 

v(t) — X ([t, b] n F-) — t'X (ft, b] n G) . 
Note that u{t) = v(t) = for t = a and for t = b. Furthermore 

(10.38) u(t) + v(t) = X{F_) - r'A(G) = for all t E [a, b] . 

Suppose that u(s) = for some s G (a,b). Then v(s) = 0. Since Q is the union of the 
two non overlapping intervals l\ := [a, s] and I2 '■= [s,b] it follows from (110. 37p that the 
inequality 

A (F+ n if) > t'X (G n if) 

must hold for at least one of the two values j = 1 and j = 2. Since we also have A (F_ Cilj) = 
t'X (G H Ij) for both these values of j, it follows that Ij is a good r'-subcube of Q for at 
least one value of j. This contradicts the r'-minimality of Q and shows that we must have 
u(s) for all s G (a,b). It follows by an analogous argument, or simply from (110. 38|) . 
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that v(s) ^ for all s G (a,b). Since both u and v are continuous functions, they cannot 
change sign on (a, b) and so one of them is strictly positive and the other is strictly negative 
on the whole interval (a, b) . 

Let us consider the first case, where u{t) > for all t G (a,b). In view of ( 110. 37[) the 
continuous function 

w(t) := A ([a, t] n F+) - r'A ([a, t] n G) 

is strictly positive for t — b and therefore for some choice (in fact infinitely many choices) 
of s G (a, b) sufficiently close to b we have w(s) > 0. This, together with the fact that 
u(s) > 0, implies that the interval [a, s] is a r'-good subcube of [a, b] for these values of 
s. This contradicts the supposition that [a, b] is r'-minimal. In the remaining case, where 
v(t) > for all t G (a, b), an analogous argument implies that [s, b] is a r'-good subcube for 
all values of s G (a, b) sufficiently close to a and thus also gives a contradiction. Since we 
have shown that, in all cases, the assumption that X(F + ) ^ A(F_) leads to a contradiction, 
our proof is complete. □ 

Proof of Proposition \1U.1(A Suppose then that Question B has an affirmative answer for 
all d G N and for at least one value of the constant r' G (0, 1/2), a value which does not 
depend on d. We will see that this implies that the hypothesis of Theorem 110.61 is always 
satisfied, in fact for a positive constant s which depends only on the given constant r'. 

Explicitly, suppose that Q is a cube in M. d and that F + and F_ are two disjoint subsets of 
Q which are finite unions of closed rectangles. Suppose furthermore that Q is a r'-minimal 
subcube of itself, i.e., a (r', Q, F + , F_)-minimal subcube in the terminology of Remark 110.51 
In order to be able to apply Theorem 1 1 . 6 1 we have to find a subcube W of Q which satisfies 
( 110. 22p for a value of s which does not depend on d. We will show that we can simply take 
W = Q. 

In view of the supposed affirmative answer to Question B, we have that X(F + ) = X(F_). 
This, together with the fact that Q is a good r'-subcube (of itself), implies that 

HQ) = X(F + ) + X(F.) + X(Q\F + \F.) 

= 2min {X(Q n F+), X(Q n F_)} + X(Q \ F + \ F_) 
> 2r'A(Q\F + \F_) + A(Q\F + \F_). 

The previous three lines imply that X(Q \ F + \ F_) < ^P+i HQ) an< i a ^ so 

2min{A(QnF+),A(gnF. 

which together give us that 

min{A(QnF + ),A(QnF_)} 



Thus the cube Q = W indeed satisfies ( 110. 22ft for 



)} = X(Q)-X{Q\F+\F.) 

= l(X(Q)-X(Q\F + \F_)) 

> -11 — }X(Q). 

~ 2 V 2r' + I J W; 
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This shows that an affirmative answer to Question B ensures the validity of the condition 
that is required in Theorem 110.61 to imply a positive answer to Question A' for the given 
constant r' and for s = 2 +i/ T ' • Therefore an affirmative answer to Question B also implies 
a positive answer to Question A for any r G (r', 1/2) and for the same value of s. □ 



11. Appendices 

The material in this section is quite standard and/or elementary. This section, or some 
parts of it, may be removed from future versions of the paper. 

11.1. Medians and the mean oscillation of a function. Throughout this subsection 
(Q, £, A) is an arbitrary measure space, E is a measurable subset of Q satisfying < 
X(E) < oo, and / is a measurable real function whose domain of definition contains E. If / 
is integrable we set /e = J E fd\. We will sometimes use the notation ) to mean 

the set f(x) G H}, where H is some subset of R. 

In essentially all our applications in this paper of the (standard) results of this subsec- 
tion, Q is R d or some Lebesgue measurable subset of M. d , and £ consists of all Lebesgue 
measurable subsets of Q , and A is <i-dimensional Lebesgue measure. 

Lemma 11.1. There exists at least one median of f on E, i.e., a number m which satisfies 
(11.1) A ({x G E : f(x) < m}) < ^A (E) and A ({x G E : f(x) > m}) < ^A (E) . 

_ — 

Proof. Let 

A = I a G R : A ({x G E : f\x) > a}) > ^A (E) 

B = I a G R : A ({x G E : f\x) < a}) > ^A (E) 

Both of these sets are non empty, by the expanding sequence theorem. The set A is an 
interval whose left endpoint is — oo. The set B is an interval whose right endpoint is +oo. 
It is also clear that A n B is empty. Thus the set M = M\A\B is non empty and is a 
bounded interval (which may also happen to be a single point) whose endpoints are sup A 
and inf B. Every number m G M must satisfy (111.11) . □ 

Lemma 11.2. The mean oscillation of f on E satisfies 
(H-2) 0{J,E) = ^-J E \f-m\d\ 

for each median m of f on E. 

Proof. Suppose that m is any median of / on E. This implies that 

(11.3) A ({x G E : f(x) < m}) < ^A (E) < A ({x G E : f(x) > m}) . 



and 
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We can perform the following calculation for each number c satisfying c < m. The transition 
from the third and fourth lines to the fifth and sixth lines uses (I11.3P and the fact that 
m — c > 0. 



\f — m\d\ 



E 



(m — f)d\+ / (f-m)dX 

/-!((-oo,m)) 7/-i([m,oo)) 



(c-f)d\+ / (f-c)dX 

/-!((-oo,m)) i/-i([m,oo)) 

+ (m - c)A(/- 1 ((-oo, m))) + (c - m)A(/" 1 ([m, cm))) 



< / (c-f)dX + (f-c)dX 

Jf-^i-oo,™)) 7/-i([m,oo)) 

+ (m - c)A(r 1 (K oo))) + (c - m)A(r 1 ([m, oo))) 



(c-/)dA+ / (/-c)dA + 0_ 

/-l((-oo,m)) 7/-i([m,oo)) 



< / \c-f\dX+ \f -c\dX= \f -c\dX. 

In the remaining case, i.e., for each c > m, we can apply the previous calculation to the 
function — /. Since — m is a median of — / on E and — c < — m, we obtain that 



-/ + m|dA< / |-/ + c|dA. 

E J E 

From these two cases we see that 

/ \f-m\dX< [ \f-c\dX 

J E JE 

for every c G R .We can now obtain (jll.2p by taking the infimum over all cGR. □ 
Lemma 11.3. The inequality 

(11-4) \f E -m\<0(f,E) 

holds for every median m of f on E. 

Proof. 

fE ~ m= X(E) J E ih ~ m)dX = X(E)J E if ~ m)dX ~X(E)J E lf ~ mldX = ° (/ ' E) 

and essentially the same argument shows that m — f E < 0(f,E). These two estimates of 
course give us f lll.4p . □ 
Let us now prove (12. 3|) . 
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|/(x)-m|dA(x) < — JJf(x)-f E \dX(x) 



1 



< 



\{E)j 
1 



(f(x)-f(y))d\(y) 



X(E)- 



KE) JE 

\f(x)-f(y)\d\(x)d\(y) 



dX(x) 



establishing the first two inequalities of ( 12.31) . For the remaining inequality we observe that 



< 



x(Ey 
i 

xJeJ 

2 



\f(x)-f(y)\dX(x)dX(y) 



ExE 



\f(x) — m\ dX(x)dX(y) + 



ExE 



x(sy 



\m - f(y)\dX(x)dX(y) 



ExE 



\f — m\ dX 



HE) JE 

which completes the proof of ( 12.31) . 

Lemma 11.4. Let mi and rri2 be medians of f on E with mi < m^. Then 

X ({x G E : mi < f(x) < m 2 }) = . 

Proof. The sets f^ 1 ((— oo,mi\) and / _1 ([m2, oc)) must each have A measure greater than 
or equal to ^X(E). So, since they are disjoint subsets of E, they must in fact both have 
measure equal to ^X(E). Thus f~ l ((m^mj)), the complement of their union in E, must 
have zero A measure. □ 



11.2. Implications between different forms of the John-Nirenberg inequality. As 

mentioned in Section [21 various papers present slightly different versions of the inequality 
(I2.13p . Let us explicitly show the connections between these versions. Although we have 
formulated some of these connections for the special case where the collection S of admissible 
subsets of D is taken to consist only of cubes, i.e, for £ = Q{D), the same connections 
clearly apply when £ is chosen to be any other collection of admissible subsets, even not 
necessarily all contained in D. 

We have to consider three kinds of "transition": 

• In some variants of (|2.13p the seminorm \\f\\ BM o(D,Q(D)) or \\f\\BMO(D,Q(D)) a PP ears in 
place of ||/||^ /C )( D q(d))- But, since these seminorms are equivalent to each other to within 
a factor of 2, an inequality using one of them obviously implies analogous inequalities using 
the others, of course sometimes with the constant b replaced by 6/2. 

• Sometimes the set {x G D : \f(x) — fo\ > «} may be replaced by the possibly larger 
set {x G D : \ f{x) — fo\ > en}. But, for a > 0, this of course does not change anything. 
The given inequality remains valid since the right hand side of the inequality is a continuous 
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function of a and 

\({xeD:\f(x)-f D \>a}) < X ({x G D : \f(x) - f D \ > a}) 

= lim A ({x G D : \f(x) - f D \ > a - 1/ra}) . 

n— s>oo 

This explanation cannot be applied for a = 0. But since we know (by the argument of 
Remark 12.41 or some obvious variant of it) that the constant B satisfies B > 1, the case 
a = is a triviality. An exactly analogous argument shows that the given inequality 
remains valid if we replace the set {x G D : \f(x) — m\ > a} by the possibly larger set 
{x G D : \f(x) — m\ > a}, when m is a median of / on D . 

• Sometimes the average fjj in the set {x G D : |/(x) — /d| > a} or in the set 

{xeD:\f(x)-f D \>a} 

may be replaced by a median m of / on D. We can invoke the following lemma to describe 
the implications of such a change. 

Lemma 11.5. Let D be some subset ofM. d and letS be some collection of admissible subsets 
E o/IR d . Suppose that N(f) denotes one of the three seminorms \2.J$ , \2. 5\) or \2. 6]) and 
that it is known that the inequality 

(11.5) A ({x G E : \f(x) - f E \ > a}) < BX(E) exp 

holds for some fixed E G £ and for all a > and for certain fixed positive constants b and 
B. Then B > 1 and the inequality 

X ({x G E : \ f{x) -m\> a}) < e b BX(E) exp 

holds for every median m of f on E and each a > 0. 
Conversely, suppose that it is known that the inequality 

(11.6) A ({x G E : \ f(x) -m\> a}) < BX(E) exp 

holds for some fixed E G S and for all a > and for certain fixed positive constants b and 
B and for some median m of f on E. Then B > 1 and the inequality 

(11.7) A ({x G E : \ f(x) -f E \ ><*})< e b BX(E) exp 
holds for every a > 0. 

Proof. If the inequality All .5j) or, respectively, the inequality All .6j) holds for all a > 0, then 
(cf. Remark I2.4[) the constant B necessarily satisfies B > 1, and, furthermore , as already 
explained above, it also follows that the same inequality (respectively ( 111.51) or ( 111.61) ) still 
holds when "> a" is replaced by "> a" in the definition of the set on the left hand side. 

In view of Lemma [11.31 and ( 12. 3 j) we have \ f E — m\ < N(f) for every median m of f on 
E. So, for each x G E, 

\f(x)-m\-N(f)<\f(x)-f E \ 
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and 

\f( x )-f E \-N(f)<\f(x)-m 
These inequalities imply, respectively, that 

(11.8) {x G E : -m\ > a} C 
and that 

(11.9) {x G E : |/(x) - f E \ > a} C 
If ([TT3j> holds for all a, then, by fflTH]) 

A ({x G E : \f(x) -m\> a}) < 

< 



If a G [0,N(f)), then 

A ({x G E : |/(z) -m\> a}) < X(E) = X(E)e b ■ e~ b < X(E)e b ■ exp 
and so, in all cases, 

A ({x G E : \f{x) -m\> a}) < e b B\(E) exp 

as required. 

The proof that f 1 1 1 . 6 j) implies (111.7P is exactly analogous, of course using f ll 1 .9j) in place 
of flTL8]) . □ 

11.3. Compositions of BMO functions with Lipschitz functions. We would not be 
at all surprised if the result (Proposition 1 11.7]) presented in this appendix is already known. 
But somehow we have not found a reference for it yet. The following completely obvious 
lemma and not very difficult proposition are not needed for obtaining the main results of 
this paper. But they may be of independent interest. They are also a kind of motivation 
for some of the steps for proving our main results. 

Lemma 11.6. Suppose that the function (p : M — > R satisfies the Lipschitz condition 
(11.10) \<p(s) - <p(t)\ < \s-t\ for alls,t£R. 

Then, for each function / G BMO(M. d ) the composed function if o / is also in BMO(R d ) 
and satisfies 

Wf ° f\\BMO(R d ) — \\f\\BMO(R d ) ■ 



{xEE:\f(x)-f E \>a-N(f)}. 



{xEE: \ f(x)-m\ >a-N(f)} . 
for all a > N(f), 

X({xEE:\f(x)-f E \>a-N(f)}) 



Be b X(E) exp 



ba \ 
WIS) ' 
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Proof. Let Q be an arbitrary cube in R d and let c be an arbitrary real constant. It follows 
immediately from (jll.lUp that 

\<pof-<p(c)\d\< / |/-c|dA 



which completes the proof. □ 

Note that in the following proposition we do not even need to require the function / 
to be locally integrable. One of its immediate consequences is that a measurable function 
/ : R d — y R is in BMO (Mr) if and only if the BMO seminorms of all the bounded functions 

fk( x ) = ^arctan ^^rpj are all dominated by a (finite) constant which is independent of 

fcGE The same conclusion follows when the bounded functions fa are defined instead by 

{—k , f(x) < -k 
f(x) , -k<f(x)<k 
k , f(x)>k. 

Proposition 11.7. Let {y2fc} fc6N be a sequence of non decreasing functions which each 
satisfy the Lipschitz condition U1.10\) . Suppose also that lim^oo (pk(t) = t for all real t 
and that, for each bounded interval (a, b) there exists an integer N such that ipk is strictly 
increasing on (a, b) for each integer k > N. 

For each integer k G N, let fa : M d — y R be defined by fa(x) = tp^(f(x)). Then f G 
BMO(M d ) if and only fa G BMO for all k G N and limsup fc6N ||/fc||^A/o ^ s fi n ^ e - ^ n f ac ^i 
for each f G BMO(M d ) , the sequence Wfk\\BMo\k&A conver 9 es t° a finite limit and 

UWbmo = , lim \\fk\\ B MO ■ 

k — ^oo 

Proof. In view of Lemma [TL~6l we have ||/|| BMO > lim sup^^ ||/fe|| BA fO wn i cn immediately 
establishes one of the implications and means that it remains only to prove that 

(11-11) UWbmo < \\M\bmo 

k— >oo 

whenever limsup^^ ||/fe|| BA/0 is finite. 

So let us indeed suppose that lim sup^^ ||/fc|| BMO is finite. 

Choose an arbitrary cube Q in R d . Suppose that c is a median of / on Q. Let us choose 
an integer N such that, for each k > N, the non decreasing function ipk is strictly increasing 
on the interval (c — 1, c+ 1). Then ipf.(f(x)) > fk{c) if and only if f(x) > c, and, similarly, 
<Pk(f(%)) < V 9 fc( c ) if an d only if f(x) < c. It follows that := y?fc(c) is a median of fa on Q. 
We also have lim^oo c& = c. For each fixed positive number M we can use the dominated 
convergence theorem to obtain that 



Jc 



Qn{x:-M<f(x)<M} k ^°° J Qn{x:-M<f(x)<M} 



\f-c\d\ = lim / \fk-c k \d\ 

k ^°° J Qn{x:-M<f(x)<M} 

lim inf / | fa — Ck \ d\ 

k ^°° jQn{x:-M<f(x)<M\ 



' Qn{x:-M<f(x)<M} 

< liminf / \fa - c k \ d\ 
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BMO 



In view of (12. 2ft (cf. Lemma [11 .2ft . this last expression is dominated by 

A(Q)liminf \\f k \\ BMO . 

So, by applying the monotone convergence theorem, we deduce that 

f \f- c \d\= lim f \f-c\dX< A(Q)liminf ||/ fc | 

JQ M ^°° JQn{x:-M<f(x)<M} k ^°° 

which shows that / G BMO and establishes (111. lip , so completing the proof. □ 

Remark 11.8. To obtain the results mentioned in the preamble to the preceding proposition 
we of course simply choose ip^ defined by ipk(t) = /carctan (|) or 

—k , t < —k 
Mi) = < f( x ) > -k <t < k 



k , t > k 



respectively. 



11.4. The boundary of a convex set in It is surely very very well known that the 
boundary dK of a convex K set in R d satisfies A (dK) =0. But let us give an explicit 
proof of this in this preliminary "lecture notes" version of the paper. By translating K 
if necessary, we can assume that the origin = (0,0, ...,0) of M. d is in K. We will also 
temporarily assume that K is bounded. If \(K) = there is nothing to prove. This means 
that we can assume that K is not contained in any d — 1 dimensional subspace of M d . 
Consequently K contains the convex hull of and d more points whose "span" is W 1 . It 
follows that K must contain a cube Q. Since we are permitted to translate K yet again if 
necessary, we may now assume that the centre of Q is at 0. For each r G (0, 1) and each 
x G K the set rx + (1 — r)Q is contained in K. This means that the convex set rK is 
contained in the interior K° of K. Thus A (K°) > r d X(K) for every r G (0, 1). Consequently 
X(K°) = X(K) which of course implies that A (dK) = 0. 

Finally, if K is unbounded, we can set K n = {x G K : ||x|| < n} and use the fact that 
dKc\} n& dK n . 

11.5. The functional 3(f,E,s) "commutes" with affine transformations. Here is 
the straightforward calculation which proves (14. 4p and therefore also (14. 5 p for the function 
g defined by g(x) = f(rx + Xo) where r is a non zero real number and xq is a constant point 
in M. d and E is an admissible set in the domain of definition of g. 

X ({x G E : \g(x) — c\ > a}) = X ({x G E : \f(rx + x ) — c\> a}) 

= X ({rx G rE : \ f(rx + Xq) — c\ > a}) 

= X ( ~{y G rE : \f(y + x ) - c\ > a} 
\r 

= r~ d X({y ErE :\f(y + x ) - c| > a}) 
= r~ d X ({y + x G rE + x : \f(y + x ) - c| > a}) 
= r~ d X ({x ErE + x : \ f(x) — c\ > a} — Xq) 
= r~ d X ({x ErE + x : \ f(x) — c\ > a}) . 



62 



MICHAEL CWIKEL, YORAM SAGHER AND PAVEL SHVARTSMAN 



This calculation, together with the fact that X(rE + Xq) = r d X(E), shows that the 
condition 



is equivalent to 

A ({x ErE + x : \ f(x) — c\ > a}) < s\(rE + x ) 

which is exactly what we need to show that 3(g, E, s) = J(f, rE + x^, s) for each admissible 
set E. 

11.6. Left continuity and right discontinuity of the function s i— > 3(f,Q,s). 

Lemma 11.9. Suppose that Q and f are as in part (ii) of Proposition \J75\ Then the 
function s h-> J(/, Q, s) is non increasing and left continuous. 

Proof. Since we know that s i— > J(f,Q,s) is non increasing, in order to show that this 
function is also left continuous, we only need to prove that 



for each fixed s G (0, 1). As in the proof of Proposition 14. 5[ let g : Q — > R denote the 
function which is the restriction of / to Q. Fix an arbitrary positive number e. In view of 
(14.1 ip there exists u G (0, sX{Q)) such that 



Of course we have u < (s — l/n)X(Q) for all sufficiently large n. Furthermore, because of 
the right continuity of g*, the inequality 



also holds for all sufficiently large n. These last two inequalities imply, again in view of 
(14. lip , that 3(f, Q,s — 1/n) < J(f, Q,s) + e for all sufficiently large n, and this suffices to 
complete the proof. □ 

Example 11.10. Suppose that d = 1 and Q = (0, 1) and a G (0, 1). Let N be the smallest 
integer for which Na > 1 and let / be the restriction to (0, 1) of the function 

NX(0,a) + (N - l)X[o,2o) + (N - 2)x[2o,3o)- + X[(N-l)a,Na) ■ 

Then / is non increasing and right continuous and /* = g* — f on (0, 1). Consequently, by 
(gUD , J(/, (0, 1), s) > 1/2 for s G (0, 1 - a] and J(/, (0, 1), s) = for s G (1 - a, 1). This 
shows that, in general, s i— > J(f, Q, s) is not right continuous. 

11.7. The proof of Lemma 110. li For part (i), let y be an arbitrary point of Q (x* , r* ) . 

For each n G N we set y n = x n + ^(y — x*) . Since \\y — x*||^oo < r* it follows that 
Un ^ Q{%n, r n)- Therefore y n G Q. Since Q is closed it follows that y = lim^oo y n is also 
in Q and this shows that Q(x*,r*) C Q. 
For part (ii) we observe that 



A ({x G E : \g(x) - c\ > a}) < sX(E) 



lim J (/, Q, s 



1/n) < 3(f,Q,s 



g*(u) -g*(u+{l- s)A(Q)) < J(/, Q, s) + e/2 . 



g* (u) -g*(u+(l-s+ l/n)X{Q)) < 3(f, Q,s)+e 



(11.12) 




XAdX - J c 



XAdX 
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Let R n = \\x n — x*\\(°o + max {r*, r„}. It is clear (from (110. ip ) that the cube Q(x*,R n ) 

d 

contains both of the cubes Q(x*, r*) and Q(x n , r n ). Therefore the expression in the last line 
of (I11.12p is dominated by 

A (<2(x*, R n ) \ Q(x*,n)) + A (Q(x*, R n ) \ Q(x n , r n )) 
= R d n -rt + R d n -r d n . 

This last expression tends to as n tends to oo. This establishes (1 1 . 2 [) and so completes 
the proof of the lemma. □ 

REFERENCES 

[1] R. J. Bagby and D. S. Kurtz, Covering lemmas and the sharp function, Proc. Amer. Math. Soc. 93 
(1985), 291-296. 

[2] R. J. Bagby and D. S. Kurtz, A rearranged good-A inequality, Trans. Amer. Math. Soc. 293 (1986), 
71-81. 

[3] C. Bennett, R. A. DeVore and R. Sharpley, Weak-L°° and BMO, Annals of Math., 113 (1981), 601-611. 
[4] C. Bennett and R. Sharpley, Weak-type inequalities for H p and BMO, Proc. Sympos. Pure Math. 35 
(I) (1979), 201-229. 

[5] J. Bergh and J. Lofstrom, Interpolation spaces. An Introduction, Springer, Berlin 1976. 

[6] C. Fefferman and E. M. Stein, H p spaces of several variables, Acta Math. 129 (1972), 137-193. 

[7] A. M. Garsia and E. Rodemich, Monotonicity of certain functionals under rearrangements, 
Ann. Inst. Fourier, Grenoble, 24 (1974), 67-116. 

[8] R. A. Hunt, On L (p, q) spaces, L'Enseignement Math. 12 (1966), 249-276. 

Since this journal may not be available in some mathematics department libraries we refer to the site: 
http : //retro . seals . ch/cntmng?type=pdf &rid=ensmat-001 : 1966 : 12 : : 91&subp=hires 

[9] B. Jawerth and A. Torchinsky, Local sharp maximal functions, J. Approx. Th. 43 (1985), 231-270. 
[10] F. John, Rotation and strain, Comm. Pure Appl. Math. 14 (1961), 391-413. 

[11] F. John, Quasi isometric mappings, Semineri 1962-1963 di Analisi Algebra, Geometria e Topologia, 
Vol II, Rome, 1965. 

[12] F. John and L. Nirenberg, On functions of bounded mean oscillation, Comm. Pure Appl. Math. 14 
(1961), 415-426. 

[13] I. Klemes, A mean oscillation inequality, Proc. Amer. Math Soc. 93 (1985), 497-500. 

[14] A. A. Korenovskii, The connection between mean oscillations and exact exponents of summability 

of functions, Mat. Sb. 181 (1990), no. 12, 1721-1727 (Russian); translation in Math. USSR-Sb. 71 

(1992), no. 2, 561-567. 

[15] A. A. Korenovskii, Mean oscillations and equimeasurable rearrangements of functions, Lecture Notes 
of the Unione Matematica Italiana, 4. Springer, Berlin; UMI, Bologna, 2007. 

[16] A. K. Lerner, Maximal functions with respect to differential bases measuring mean oscillation, Anal. 
Math. 24 (1998), 41-58. 

[17] J. Moser, On Harnack's Theorem for Elliptic Differential Equations, Comm. Pure Appl. Math. 14 
(1961), 577-591. 

[18] F. Nazarov, M. Sodin and A. Vol'berg, The geometric Kannan-Lovasz-Simonovits lemma, dimension- 
free estimates for volumes of sublevel sets of polynomials, and distribution of zeroes of random analytic 
functions, arXiv : math/0 1082 12v2 [math.CA] 

[19] F. Nazarov, M. Sodin and A. Vol'berg, The geometric Kannan-Lovasz-Simonovits lemma, dimension- 
free estimates for the distribution of the values of polynomials, and the distribution of the zeros of 
random analytic functions, Algebra i Analiz 14 (2002), no. 2, 214-234 (Russian); translation in St. 
Petersburg Math. J. 14 (2003), no. 2, 351-366 

[20] Y. Sagher, Lecture Notes on Measure Theory and Real Analysis. (In preparation). 

[21] Y. Sagher and P. Shvartsman, On the John-Stromberg- Torchinsky characterization of BMO, J. Fourier 
Anal. Appl. 4 (1998), 521-548. 



64 



MICHAEL CWIKEL, YORAM SAGHER AND PAVEL SHVARTSMAN 



[22] Y. Sagher and P. Shvartsman, Rearrangement-Function Inequalities and Interpolation Theory, J. Ap- 

prox. Theory 119 (2002), 214-251. 
[23] P. Shvartsman, The if-functional of the pair (Loo(w), BMO), Function spaces, interpolation spaces, 

and related topics (Haifa, 1995), 183-203, Israel Math. Conf. Proc, 13, Bar-Ilan Univ., Ramat Gan, 

1999. 

[24] E. M. Stein, Harmonic analysis: real-variable methods, orthogonality, and oscillatory integrals. With 

the assistance of T. S. Murphy. Princeton Mathematical Series, 43. Monographs in Harmonic Analysis, 

III. Princeton University Press, Princeton, NJ, 1993. xiv+695 pp. 
[25] J.-O. Stromberg, Bounded mean oscillation with Orlicz norms and duality of Hardy spaces, Indiana 

Math. J. 28 (1979), 511-544. 
[26] R. Wheeden and A. Zygmund, Measure and integral. An introduction to real analysis. Pure and Applied 

Mathematics, Vol. 43. Marcel Dekker, Inc., New York-Basel, 1977. 
[27] I. Wik, On John and Nirenberg's theorem, Ark. Mat, 28 (1990), 193-200. 

Cwikel and Shvartsman: Department of Mathematics, Technion - Israel Institute of 
Technology, Haifa 32000, Israel 

Sagher: Department of Mathematics, Florida Atlantic University, 777 Glades Road, 
Boca Raton, FL 33431, USA 

E-mail address: mcwikel@math.technion.ac.il 
E-mail address: yoram . sagherOgmail . com 
E-mail address: pshv@tx.technion.ac.il 



